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(Dated: February 28, 2011) 



The collection of a large number of B hadron decays to hadronic final states at the CDF 11 detector 
is possible due to the presence of a trigger that selects events based on track impact parameters. 
However, the nature of the selection requirements of the trigger introduces a large bias in the 
observed proper decay time distribution. A hfetime measurement must correct for this bias and the 
conventional approach has been to use a Monte Carlo simulation. The leading sources of systematic 
uncertainty in the conventional approach are due to differences between the data and the Monte 
Carlo simulation. In this paper we present an analytic method for bias correction without using 
simulation, thereby removing any uncertainty due to the differences between data and simulation. 
This method is presented in the form of a measurement of the lifetime of the B~ using the mode 
B~ — >■ D^tt". The B~ lifetime is measured as Tg- = 1.663 ± 0.023 ± 0.015 ps, where the first 
uncertainty is statistical and the second systematic. This new method results in a smaller systematic 
uncertainty in comparison to methods that use simulation to correct for the trigger bias. 

PACS numbers; 14.40Nd;13.25.Hw;29.85.Fj 



I. INTRODUCTION 



The weak decay of quarks depends on funda- 
mental parameters of the standard model, includ- 
ing the Cabibbo-Kobayashi-Maskawa (CKM) 
matrix, which describes mixing between quark 
families [11, Extraction of these parame- 

ters from weak decays is complicated because 
the quarks are confined within color-singlet 
hadrons as described by quantum chromodynam- 
ics (QCD). An essential tool used in this extrac- 
tion is the heavy quark expansion (HQE) tech- 
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niquc 0. In HQE the total decay width of a 
heavy hadron is expressed as an expansion in in- 
verse powers of the heavy quark mass m^. At 
0{l/mb) the lifetimes of all B hadrons are iden- 
tical. Corrections to this simplification are given 
by 0{l/ml) and 0(1/™^) calculations leading 
to the predicted lifetime hierarchy: t{B^) > 
t{B°) w t(B°) > T{Ab) > t{Bc) and quantita- 
tive predictions of the lifetime ratios with respect 
to the B° meson [i-Q. 

The Tevatron pp collider at y/s = 1.96 TeV has 
the energy to produce aU B hadron species. The 
decays of these hadrons are selected by a variety 
of successive trigger selection criteria applied at 
three trigger levels. Unique to the CDF II de- 
tector is the silicon vertex trigger (SVT), which 
selects events based on pairs of tracks displaced 
from the primary interaction point. This exploits 
the long lived nature of B hadrons and collects 
samples of B hadrons in several decay modes, tar- 
geting in particular the fully hadronic B decays. 
Many different measurements of the properties 
of B hadrons have been made using samples se- 
lected by this trigger, examples of which are given 
in Ref. [IMI- 

However, this trigger preferentially selects 
those events in which the decay time of the B 
hadron is long. This leads to a biased proper 
decay time distribution. The conventional ap- 
proach to correct this bias has been through the 
use of a full detector and trigger simulation. An 
important source of systematic uncertainty, in- 
herent in this conventional approach, is how well 
the simulation represents the data. A full and 
accurate simulation of data collected by this trig- 
ger is particularly difficult due to the dependence 
on many variables including particle kinemat- 
ics, beam-interaction positions, and the instanta- 
neous luminosity. The differences between data 
and simulation are the dominant systematic un- 
certainties in the recent CDF measurement of the 
Ah lifetime [l^. These systematic uncertainties 
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will be the limiting factor in obtaining precision 
measurements of b hadron lifetimes in data sam- 
ples collected by methods that introduce a time 
distribution bias. In this paper we present a new 
analytical technique for correction of the bias in- 
duced by such a trigger. This technique uses no 
information from simulations of the detector or 
physics processes, and thus incurs none of the 
uncertainties intrinsic to the simulation based 
method. 

The technique is presented in a measurement 
of the B~ meson lifetime using the decay mode 
B~ — >■ D°7r~ (charge conjugate decays are im- 
plied throughout). This decay channel is chosen 
as the lifetime of the B~ is already well known 
and the high yield available in this channel al- 
lows a good comparison to the world average. 
This measurement demonstrates the ability of 
this method to reduce the overall systematic un- 
certainty on a lifetime measurement. A displaced 
track trigger is expected to operate at the LHCb 
detector, and the technique of lifetime measure- 
ment presented here is applicable to any data 
where the method of collection induces a bias in 
the proper decay time distribution. 



II. OVERVIEW 

The simulation-independent method, pre- 
sented here, for removing the trigger-induced 
lifetime bias is based on using a candidate-by- 
candidate efficiency function for each B meson 
candidate. This efficiency function is calculated 
from the event data, without recourse to simu- 
lation. This approach is based on the observa- 
tion that for a given set of decay kinematics of 
the decay B~ — D°7r~ (i.e the four momenta 
of the final state particles and the flight distance 
of the D) the decay time dependent efficiency 
function has a simple shape that can easily be 
calculated from the measured decay kinematics 
and the known decay time dependent cuts. This 
provides a simple and robust method for taking 
into account the effect of the trigger by calculat- 
ing a different efficiency function for each candi- 
date and applying it, candidatc-by-candidatc, in 
a likelihood fit. The details of this calculation 
are presented in Sec. |Vl 

As discussed in Ref. [l^, if a candidate-by- 
candidate quantity (here, the efficiency function) 
enters a fit with a signal and background com- 
ponent, the probability density function (PDF) 
for this quantity needs to be included in the fit, 
unless it happens to be identical for both compo- 
nents. In our case, this constitutes a significant 
complication as it requires fitting a distribution 
of efficiency functions rather than just numbers. 



This is accomplished with an unusual application 
of the Fisher discriminant method to translate 
each efficiency function into a single number, de- 
scribed in Sec. I VIII 

While wc do not use any input from simula- 
tion in extracting the B lifetime from the data, 
wc do use simulated events to test our analysis 
method and also to evaluate systematic uncer- 
tainties. We use a full GEANxS-based detector 
simulation [l7| . (which includes a trigger simu- 
lation), as well as a detailed fast simulation for 
high-statistics studies. The results of the simula- 
tion studies are presented in Sec. lVIl and Sec. lIXl 
In Sec. IVIIII we show the results of applying the 
method to our data, and in Sec.|X]wc summarize 
our conclusions. A brief description of the rele- 
vant components of the CDF detector, in partic- 
ular, the trigger is given in Sec. IIIII followed by 
the description of the event reconstruction, data 
selection and sample composition in Sec. IIVI 



III. THE CDF II DETECTOR AND 
TRIGGER SELECTION 

This analysis uses data corresponding to 1 fb~^ 
of integrated luminosity collected by the CDF II 
detector at the Fermilab Tevatron using pp col- 
lisions at ^/s = 1.96 TeV. The data were col- 
lected during the first four years (2002-2006) of 
the ongoing Run-II data taking period. The CDF 
II detector is described in detail elsewhere [T8| . 
A brief description of the most relevant detector 
components for this analysis follows. 



A. CDF II Detector 

The CDF II detector has a cylindrical geom- 
etry with forward-backward symmetry. It in- 
cludes a tracking system in a 1.4 T magnetic 
field, coaxial with the beam. The tracking system 
is surrounded by calorimeters and muon detec- 
tion chambers. A cylindrical coordinate system, 
(r, 0, z) is used with origin at the geometric cen- 
ter of the detector, where r is the perpendicular 
distance from the beam, (j) is the azimuthal angle 
and the z direction is in the direction of the pro- 
ton beam. The polar angle 9 with respect to the 
proton beam defines the pseudorapidity rj which 
is given by r/ = — ln(tan |). 

The CDF II detector tracking system consists 
of an open cell argon-ethane gas drift chamber 
called the central outer tracker (COT) |19|. a sili- 
con vertex microstrip detector (SVX-II) [20], and 
an intermediate silicon layer detector (ISL) plj . 
The SVX-II is 96 cm long, with three sub-sections 
in z and has five concentric layers of double 
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sided silicon microstrip detectors from r=2.45 to 
r=10.60 cm segmented into 12 wedges in (j). The 
COT is 310 cm long, consisting of 96 sense wire 
layers grouped into eight alternating axial and 2° 
stereo superlayers. The ISL lies between a radius 
of 20.0 and 29.0 cm and helps in extending the 77 
coverage of the SVX-II and COT. Together the 
SVX-II, ISL, and COT provide r-cj) and z mea- 
surements in the pseudorapidity range | 77 |<2 or 
I ry |<1 for tracks traversing all eight COT super- 
layers. 



B. Track Parametrization 

A charged particle has a helical trajectory in 
a constant magnetic field. A description of the 
five parameters used to describe charged parti- 
cle tracks at the CDF experiment follows. In 
the transverse plane, which is the plane perpen- 
dicular to the beam direction and described by 
X and y coordinates, the helix is parametrized 
with track curvature C, impact parameter do, 
and azimuthal angle (f>o- The projection of the 
track helix onto the transverse plane is a circle 
of radius R, and the absolute value of the track 
curvature is | C |= The curvature is related 
to the magnitude of the track's transverse mo- 
mentum, pr, by I C 1= ^■id»QS-w-''-B ^ ^Yiem C 

is in cm~^, B is in Tesla and pr is in GeV/c, 
where c is the speed of light in vacuum. The sign 
of the curvature matches the sign of the track 
charge. The absolute value of corresponds to 
the distance of closest approach of the track to 
the beam line. The sign of da is taken to be 
that oi (p X d) ■ where p is the unit vector in 
the direction of the particle trajectory, d is the 
direction of the vector from the primary interac- 
tion point to the point of closest approach to the 
beam, and z is the unit vector in the direction of 
increasing z. The angle (jjo is the azimuthal angle 
between x and the particle momentum at closest 
approach. The two remaining parameters that 
uniquely define the helix in three dimensions are 
the cotangent of the angle 9 between the z axis 
and the momentum of the particle and zq, the 
position along the z axis at the point of closest 
approach to the beam. 



C. Trigger Selection 

The CDF II detector hadronic B trigger is at 
the heart of this analysis. It collects large quan- 
tities of hadronic B decays, but biases the mea- 
sured proper decay time distribution through its 
impact-parameter-based selection. The CDF II 



detector has a three level trigger system. The 
first two levels, level 1 (LI) and level 2 (L2), are 
implemented in hardware and the third, level 3 
(L3), is implemented in software on a cluster of 
computers using reconstruction algorithms sim- 
ilar to those used offline. The CDF trigger has 
many different configurations of selection require- 
ments designed to retain specific physics signa- 
tures. In this paper we refer to the family of trig- 
gers aimed at collecting samples of multi-body 
hadronic B decays as the "two track trigger" . 

At LI the trigger uses information from the 
extremely fast tracker (XFT) [2^ . It requires two 
tracks in the COT and imposes criteria on track 
Pt and openin g an gle. At L2 the silicon vertex 
trigger (SVT) [23, which uses silicon hits and 
fast pattern recognition, reapplies the pT criteria, 
associates silicon hits with each XFT track and 
requires that the absolute value of each track's 
do lies between 120 and 1000 fim. 

A determination of the beam collision point 
or primary vertex is continuously made by the 
SVT during each data taking period (defining a 
run) and is used by all relevant triggers. After 
data taking is complete, the offline algorithm uses 
full detector information and fully reconstructed 
three dimensional tracks for a more accurate de- 
termination. At L2 additional criteria are im- 
posed on variables calculated from each track 
pair found by the SVT. The variables are: the 
product of the track charges (opposite or same 
sign) , a track fit quantity, the opening angle of 
the two tracks in the transverse plane, the scalar 
sum of the px of the two tracks, and the L^y, 
where the L^y is the projection of the distance 
between the primary vertex and two track inter- 
section along the direction of the sum of the two 
track Pt- The L3 trigger uses a full reconstruc- 
tion of the event with all detector information, 
(although using a slightly simpler tracking algo- 
rithm than the one used offline) and reconfirms 
the criteria imposed by L2. In addition, the dif- 
ference in zq of the two tracks is required to be 
less than 5 cm removing events where the pair of 
tracks originate from different collisions within 
the same crossing of p and p bunches. The im- 
pact parameter for any given track measured by 
the L2 (SVT) is, in general, different from the 
impact parameter calculated by the L3 or offline 
reconstruction algorithms for the same track due 
to the differing algorithms. These different mea- 
surements of impact parameter are referred to in 
this paper as d};^, and dfbom L2 (SVT), 
L3, and offline algorithms, respectively. 

Three different two-track trigger configura- 
tions are used in this analysis. Their criteria are 
summarized in Table U in terms of the quantities 
described above. It is clear that the impact pa- 
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rameter and L^y requirements will preferentially 
select long-lived B hadron decays over prompt 
background. The three selections arc referred 
to as the \ow-pT, medium-py and high-p^ selec- 
tions. This is a reference to their single track pT 
(> 2.0, 2.0, 2.5 GeV/c, respectively) and track 
pair PT scalar sum (> 4.0, 5.5, 6.5 GeV/c, re- 
spectively) selection requirements. 

The requirements of the three trigger selec- 
tions mean that any event that passes the high- 
Pt selection, simultaneously satisfies the require- 
ments of the low and medium px selections. The 
three separate selection criteria exist because of 
the need to control the high trigger acceptance 
rates that occur at high instantaneous luminos- 
ity due to high track multiplicity. The rates 
are controlled by the application of prescaling, 
which is the random rejection of a predefined 
fraction (dependent on the instantaneous lumi- 
nosity) of events accepted by each trigger selec- 
tion. Therefore only the higher purity, but less 
efficient, high-p^ selection is available to accept 
events at higher luminosities. 

The SVT single track finding efficiency as a 
function of df , e{df), is an important factor 
in this analysis. There have been three improve- 
ments in the SVT efficiency over the course of the 
data taking time period used by this analysis due 
to changes in the pattern recognition algorithm. 
These have led to three consecutive time periods 
in which e{d'^) has improved. These three peri- 
ods and different resulting efficiencies are incor- 
porated into the analysis as described in Sec. |Vll 



IV. DATA SELECTION AND EVENT 
RECONSTRUCTION 

A. Reconstruction of the decay — > D^tv^ 

The reconstruction of the decay D'-*n^ 
uses data collected by the two track trigger de- 
scribed in Sec. IIII CI Standard track quality 
selection criteria are applied to all individual 
tracks: each track is required to have pr > 0.4 
GeV/c, I 77 I < 2, a minimum of five hits in at 
least two axial COT super layers, a minimum of 
five hits in at least two stereo COT super lay- 
ers and a minimum of three silicon hits in the 
SVX-II r-<j) layers. Candidate D° -> A'-7r+ or 
D'^ K^TT" are searched for first. As no par- 
ticle identification is used in this analysis, the 
search for D^[D'^) candidates considers all pairs 
of oppositely charged tracks which are then as- 
sumed to be K~ and 7r"'"(7r~ and K^) and as- 
signed the kaon and pion (pion and kaon) masses, 
respectively. The two tracks are then constrained 
to come from a common vertex and the invariant 



mass {mjjo) and pt{D^) are calculated. Can- 
didates are required to have a mass within 0.06 
GeV/c^ of the world average D° mass, 1.8645 
GeV/c2 [H, and pt{D^) > 2.4 GeV/c. The 
K~Tr~^ pair is required not to exceed a certain 
geometric separation in the detector. Defining 
the separation in the rj-cj) plane, in terms of 
the differences in 77 and (f) of the two tracks, 
as AR = ^y Arf + A(j>'^, we require AR < 2. 
The separation in zq of the two tracks is re- 
quired to be Azo < 5 cm. The candidate D° 
is then combined with each remaining negatively 
charged track with pT > 1 GeV/c in the event. 
These are assumed to be pions from the decay 
B~ D°Tr~. The D° and the it~ are con- 
strained to a common vertex assumed to be the 
decay point of the B^ with the Z?" mass con- 
strained to the world average. The three tracks 
can be combined to measure the invariant mass 
of the candidate B^ , ttib- 

Proper decay time calculations in this paper 
are made using distances measured in the plane 
transverse to the beam. The proper decay time 
of the B~ , t, is given by 



t = 



L 



xy 



c(/37), 



(1) 

CPT 



where L^y is the projection of the distance 
from the primary vertex to the B~ vertex along 
the direction of the transverse momentum of the 
B^ and {I3j)t = is the transverse Lorentz 
factor. The statistical uncertainty on L^y, c^L^y, 
is calculated from the full covariance matrix of 
the vertex constrained fit and is dominated by 
the primary vertex resolution which is approxi- 
mately 33 ^m. We have used the average beam 
position per run, which is calculated offiine for 
each run, as an estimate of the primary vertex 
position. The uncertainty on the proper decay 
time is calculated by transforming a^^ into the 
B rest frame. 

To reduce background we require that the B~ 
candidate must have: 5.23 < tob < 5.5 GeV/c^, 
< i < 10 ps, PT > 5.5 GcV/c, L^y > 350 ^m, 
that the impact parameter of the B with respect 
to the beam spot is smaller than 80 /im, and 
that at < 0.333 ps where at is the decay time 
uncertainty. We also require that the of the 
vertex constrained fit is less than 15, that all 
tracks have zq within 5 cm of each other, and 
that Ai?(£)0,7r") < 2. 

It is possible to reconstruct candidates where 
no pair of tracks in the final state meet the trigger 
criteria. The lifetime measurement method pre- 
sented here cannot be used on these candidates, 
and they are removed by reconfirming the trigger. 
Wc require that at least one track pair from each 
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TABLE I: Trigger selection criteria for the three two-track trigger selections. We use n/a where no criterion 
is applied, f - The trigger requirements on the were altered during the data taking period. The quantity 
in brackets refers to the first 0.21 fb~l collected. 



Trigger criteria LI 


Units 


Low Px 


Medium px 


High Px 


Minimum track px 


GcV/c 


2.0 


2.0 


2.5 


Two track charge product 




n/a 


-1 


-1 


Two track max A(f) 


degrees 


90° 


135° 


135° 


Minimum two track px scalar sum 


GeV/c 


4.0 


5.5 


6.5 


Trigger criteria L2 


Minimum dg^ 


^m 


120 


120 


120 


Maximum c?g^ 


^m 


1000 


1000 


1000 


Minimum track px 


GeV/c 


2.0 


2.0 


2.5 


Maximum track 




15(25)t 


15(25)t 


15(25)t 


Two track charge product 




n/a 


-1 


-1 


Maximum pair A0 


degrees 


90° 


90° 


90° 


Minimum pair A0 


degrees 


2° 


2° 


2° 


Minimum two track px scalar sum 


GcV/c 


4.0 


5.5 


6.5 


Minimum two track Lxy 


/im 


200 


200 


200 


Trigger criteria L3 


Minimum Idg'^l 


/im 


80 


80 


80 


Maximum dg^ 


/im 


1000 


1000 


1000 


Minimum track px 


GcV/c 


2.0 


2.0 


2.5 


Maximum track rj 




1.2 


1.2 


1.2 


Two track charge product 




n/a 


-1 


-1 


Maximum pair A0 


degrees 


90° 


90° 


90° 


Minimum pair A0 


degrees 


2° 


2° 


2° 


Maximum pair Azg 


cm 


5.0 


5.0 


5.0 


Minimum two track px scalar sum 


GcV/c 


4.0 


5.5 


6.5 


Minimum two track L^y 


/im 


200 


200 


200 



candidate decay pass the L2 and L3 trigger se- 
lection requirements. The particular L2 and L3 
selection that the decay must pass depends on 
which trigger selection accepted the event during 
data taking. In the case where more than one 
trigger selection was satisfied during data taking, 
we require that the candidate satisfies the least 
stringent selection. Reconfirmation of the trig- 
ger requires that the offline reconstructed tracks 
are associated to L2 and L3 tracks in the event. 
To match an offline track to a L2 or^L3 track 

we calculate the = + (^) between 

an offline track and each L2 or L3 track in the 
candidate, where AC and At/) arc the differences 
between the offline and L2 or L3 track C (cur- 
vature) and 0, respectively, and ac are 
the mean uncertainties on the offline track C and 
(j>, respectively. The L2 or L3 track that has the 
lowest is associated with the corresponding 
offline track. If the of the L2(L3) track with 
the lowest x^ is greater than 95(25) we consider 



the match unsuccessful, and deem that the offline 
track has no L2(L3) matched track. 

Collectively, the trigger selection requirements 
and the cuts made on offline or derived variables 
are referred to as the selection criteria. The kine- 
matics of each track are used to calculate the ef- 
ficiency function central to this method. We use 
the following nomenclature to refer to each indi- 
vidual track. The pion originating from the 
vertex is referred to as tt^ and the pion and kaon 
originating from the D vertex are referred to as 
ttd and Kd, respectively. 



B. Sample composition and signal yield 

The invariant D'^tt^ mass distribution after 
the selection criteria have been applied is shown 
in Fig.[TJ The low mass background sideband and 
a small part of the signal peak have been removed 
by the requirement that nis > 5.23 GeV/c^. 
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This cut has been apphcd to remove partially 
reconstructed B~ — > D*'^tt^ / p~ and 
£)(*)-7]-+/p+ decays, where only three tracks of 
the final state are used in reconstruction lead- 
ing to a low reconstructed B mass. If left in the 
sample, these partially reconstructed B mesons 
would bias the proper decay time distribution, 
since they resemble signal candidates, but, due 
to the missing momentum, their proper decay 
time has been mis- measured (see Eq. ([T])). De- 
tailed Monte Carlo studies have shown that the 
applied mass cut leaves the signal peak with a 
negligible contamination ( 0.15%) from partially 
reconstructed B^ D*'^Tr^ decays. No other 
partially reconstructed B hadron decays are ex- 
pected to populate this mass range. The Cabibbo 
suppressed decay B^ D'-*K~ is also present in 
this sample, where the kaon from the B is recon- 
structed as a pion. The lower mass cut does not 
remove all of these candidates, but a tighter cut 
would remove too many B~ — >■ D^tt~ candidates. 
For simplicity, the B~ — >■ D^K~ candidates are 
not fit separately and are treated as B~ -> Z?°7r~ 
candidates for the lifetime determination. This 
simplification is motivated by the small size of 
the contamination (3%), and the small differ- 
ence in reconstructed proper decay time between 
the K and the tt mass assignment of the kaon 
track which is of order 1%. The resulting sys- 
tematic uncertainty was evaluated and found to 
be negligible (Sec.lIXp. The mass distribution of 
the remaining signal candidates, including both 
B- -> D^TT- and B- D°K- is modeled by 
the sum of two Gaussians each with an indepen- 
dent mean and width. The background candi- 
dates are due to track combinations that mimic 
the signature of signal decays. The mass distri- 
bution of background candidates is modeled by a 
linear function. An alternative description which 
allows for a second order polynomial to model 
the background was found to be degenerate with 
the linear function. 



To determine the signal yield the mass distri- 
bution is fit by maximizing an unbinned log likeli- 
hood, £, which is calculated using the mass, m,, 
for each candidate. The letters s and b denote 
whether the PDF describes signal or background 
candidates. The likelihood is given by 



TV 



\og{£)^\ogll[[fsV{m,\s) 



(2) 



where is the signal fraction and V {mi\s) is 



+ il-fs)Vim,\b) , 



given by 
V{m,\s) = 



fi 



(1 - /i) 
j^^t 

(72 V StT 



(3) 



•A 



where the factor A is required to satisfy the nor- 
malization condition 



"ihigh 



V{mi\s)dmi = 1. 



(4) 



V^milb) is described by a first order polynomial 
and is given by: 



V{m,\b) 



1 — anii 



2 (™high 



'low 



(5) 



where toiow and mhigh are the lower and upper 
mass limits, 5.23 and 5.5 GeV/c^, respectively. 

The free parameters in the mass fit are mi , TO2 , 
CTi; (72, Q^! fi, and fs- The data are fit and the 
mass fit projection is shown in Fig. [T] From the 
results of the mass fit a yield of 23900±200 signal 
candidates is determined. We define the upper 
sideband to be the candidates with 5.38 < ms < 
5.5 GeV/c^. These candidates are retained to 
constrain the parameters of the background com- 
ponent of the lifetime fit. The best fit parameters 
are given in Appendix iDl 

The results of the mass fit are also used to 
extract the signal distribution of various param- 
eters using background subtraction. We use this 
technique in several places for cross checks, but 
not as a method to extract the lifetime or any 
other fit parameter. For the purpose of back- 
ground subtraction, we define a signal window 
by 5.25 < TOB < 5.31 GeV/c^ The results of the 
mass fit are used to calculate the fraction of back- 
ground candidates in the signal region. For any 
given parameter, we subtract an appropriately 
scaled high mass sideband distribution from the 
distribution found in the signal region to obtain 
the signal distribution in data. 



V. REMOVING THE 
SELECTION-INDUCED BIAS FOR 
SIGNAL EVENTS 

A. Introduction 

In this section wc derive the PDF that takes 
into account lifetime bias due to the trigger and 
other selection criteria without input from simu- 
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FIG. 1: The top plot shows the mass fit projection (line) on the data (points). The bottom plot shows the 
residual divided by the error for each bin: {Nat — A'data) /V Ndata- 



lation. Only the case of pure signal is considered 
in this section, whereas the complications intro- 
duced by the presence of background candidates 
are discussed in Sec. IVIIl 

Before describing the PDF in detail, we give 
a short overview of the essential idea behind our 
method of correcting for the trigger effects in a 
completely data-driven way. We start by con- 
sidering an unbiased proper decay time distribu- 
tion, which is given by an exponential. To incor- 
porate detector effects, the exponential is con- 
volved with a resolution function. For the pur- 
pose of this measurement, the proper decay time 
resolution function at the CDF detector is ade- 
quately described by a single Gaussian of fixed 
width. For a decay with mean lifetime t and 
Gaussian proper decay time resolution of width 
at, the probability density to observe a signal 
candidate decaying with proper time ti, where 
the subscript i labels the candidate, is given by 



1 



r \(7t T 



where F(x) = 



1 



/27r 



(6) 



e 2 dy, 



and s indicates that this PDF is for signal events 
only. Now consider a dataset subject to the re- 



quirement that the lifetime t is within the interval 
t e [a,b]. In this case, the PDF in Eq. ^ must 
be modified to take into account this selection. 
The effect of the selection can be accounted for 
by correct normalization so that the PDF is now 



ViU;T\s) 



(7) 



'F 



dt 



The same equation can be written as 



E{t)\t 



/ E{ty-e^+^FU-^-f)dt 

— oo 

(8) 

where, for the example given here, the value of 
the efficiency function E{t) is one for a < t < 6 
and zero otherwise. This is essentially the form of 
the lifetime PDF for candidates collected by the 
selection criteria at CDF, except that the func- 
tion E{t) will take a slightly more complicated 
form, and will be different candidate by candi- 
date. We indicate this by adding a subscript i 
that labels the candidate, Ei{t,£s) ■ The intro- 
duction of Es is made because the efficiency func- 
tion will also be shown to depend on Es which 
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is the single track finding efScicncy at level 2. 
This candidate-by-candidate efficiency function 
Ei{t,£s) is the crux of this analysis, and it will 
be described in detail in the following sections. 

The CDF trigger selects on the impact param- 
eters of the tracks in the decay. The impact 
parameter requirements can be translated to an 
upper and lower decay time selection for each 
candidate. These upper and lower lifetime limits 
depend on the kinematics of the decay and there- 
fore differ for each candidate — hence the need 
for a candidatc-by-candidate Ei(t,es)- 

In order to calculate the efficiency function, 
Ei{t,es) for a given candidate we require: the in- 
dividual candidate's decay kinematics, measured 
in the data; the single track finding efficiency 
£s(also extracted from the data); and the trig- 
ger and offline criteria, collectively referred to by 
the symbol T. In terms of these variables, the 
PDF for a candidate with decay time ti is 

V{U;T\T,E,{t,es),s) = 

Edt,es)\t=u^^e^^i^F[^-^) 

/ E,{t,es)x^e^+^F(4^~^)dt 

— OO 

To summarize, we use a different efficiency 
function Ei{t,es) for each candidate i, which en- 
sures the correct normalization of the lifetime 
PDF given the selection. We calculate each 
Ei{t,es) analytically from the candidate's de- 
cay kinematics and the selection criteria, in a 
completely data-driven way, without recourse to 
Monte Carlo. The exact form of Ei{t,es), and 
how it is calculated, is discussed next. 

B. Calculation of ^^^(t, Es) 

1. Scanning through different potential proper 
decay times 

In order to find the function Ei(t,es) for a 
given candidate i, we need to find the trigger effi- 
ciency for that candidate for all possible B proper 
decay times. We scan through different B decay 
times by translating the B decay vertex along the 
B flight direction, defined by the reconstructed 
B momentum. At each point in the scan, we re- 
calculate all decay-time dependent properties of 
the candidate, in particular the impact parame- 
ters and decay distance. Properties that are in- 
dependent of proper decay time (before selection 
is applied), such as the four momenta of all par- 
ticles or the fiight distance of the intermediate D 
meson, remain constant. We re-apply the trig- 
ger and other selection criteria to the translated 



candidate. If the translated candidate fails the 
selection criteria, Ei{t,£s) is zero for that candi- 
date at the corresponding decay time. Otherwise 
Ei{t, Es) is non-zero at time t and its exact value 
depends on the SVT (L2) track-finding efficiency, 
Es- This method of scanning through different 
potential proper decay times allows for the deter- 
mination of the effective upper and lower decay 
time cuts applied by the selection criteria. This 
process is illustrated and described in detail in 
Sec. IVB4I Prior to this, we discuss two compli- 
cations to the basic idea presented above. The 
SVT has a track finding efficiency smaller than 
that of offline track flnding efficiency. The SVT 
track finding efficiency varies as a function of the 
track impact parameter. The impact of this vari- 
ation and the necessary changes to the basic idea 
are discussed in Sec. IVB 21 A secondary com- 
plication is that at different stages in the event 
reconstruction and selection, different algorithms 
are used to calculate the track parameters - very 
fast algorithms at L2, more detailed ones at L3, 
and finally the full tracking and vertexing in the 
flnal offline reconstruction. The measured values 
of track parameters such as impact parameters 
differ slightly depending on the algorithm used 
for the calculation. Section IVB 31 describes how 
the different measurements of impact parameter 
are accounted for. 



2. The value of Ei{t,Ss) and its dependence on the 
SVT track finding efficiency 

a. The need to include the dependence on 
Es If the track-flnding cfflciency is independent 
of proper decay time, one can base a flt on a 
PDF given that a certain track combination has 
been reconstructed and seen by the trigger. This 
would imply that the track finding efficiency is 
constant as a function of the impact parameter 
since the decay time and the impact parame- 
ter are correlated. In the case where the track- 
finding efficiency is proper decay time indepen- 
dent, the set of tracks seen by the trigger would 
be treated exactly in the same way as the de- 
cay kinematics, i.e. as something that can be 
kept constant as the decay distance is changed 
for the efficiency function evaluation. Given that 
a certain track combination has been found, the 
trigger efficiency at a certain decay time is either 
1 (passes selection) or (fails), independent of 
e^. This PDF would ignore one factor: the prob- 
ability that exactly this track combination has 
been found. If this factor is proper decay time 
independent it does not affect the maximum of 
the likelihood and hence the result of the fit. 

The level 3 tracking algorithms arc very similar 
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to those used offline and the level 3 track finding 
efficiency as a function of offline impact param- 
eter is constant. Therefore the track finding ef- 
ficiency at level 3 is decay time-independent and 
the situation is that described above; the level 3 
trigger efficiency is a time independent constant 
for all decay times that pass the selection crite- 
ria. Therefore it is not necessary to consider the 
effect of the level 3 track finding efficiency fur- 
ther. However, the situation at level 2 is more 
complicated. 

Figure shows the SVT track finding effi- 
ciency for tracks found in the offiine reconstruc- 
tion, in data, as a function of the track's offiine 
impact parameter \dQ^\. Figure [2] shows that the 
SVT track finding efficiency of the CDF II detec- 
tor depends on the track impact parameter, and 
therefore on the decay time of the parent parti- 
cle. The SVT track finding efficiency is approx- 
imately constant for < \dQ^\ < 1000 /J,m and 
faUs rapidly for \df\ > 1000 fj,m. The efficiency 
distribution is obtained from the signal region of 
the data sample used in the fit, using the follow- 
ing method: the efficiency prior to triggering is 
obtained by considering the sub-sample of can- 
didates where two particular tracks can pass the 
trigger requirements. For these candidates, the 
remaining third track is used to obtain the SVT 
track finding efficiency. 



by the SVT can be used to calculate the trig- 
ger efficiency, and we can assume that the same 
tracks would be found as the decay vertex is 
scanned along the direction of the B momentum. 
However, since Eg does vary with |do^|, the prob- 
ability of the SVT finding tracks is dependent 
on the decay position. The more track combi- 
nations there arc that pass the selection criteria, 
the higher the probability that at least one is 
found by the SVT. Under these circumstances, 
the proper decay time dependence of the SVT 
trigger efficiency has to be taken into account, 
which requires some parametrization of the sin- 
gle track finding efficiency as a function of d^^ . 

b. Parametrizing es{\dQ^\) While the inclu- 
sion of the single track finding efficiency in 
the PDF increases the complexity of the mea- 
surement, we can take, as a good approxima- 
tion, the following simple model. We model 
the SVT track finding efficiency as constant for 
< \d^^\ < 1000 ^m. We treat all tracks with 
14^1 > 1000 ^m as not-found by the SVT (which 
docs not affect the trigger decision as it requires 
120 fim < I^Q^I < 1000 ^m), so that we can de- 
scribe the SVT efficiency by the following simple 
description: 



Es ff \df\ < 1mm 
otherwise 



(10) 
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FIG. 2: The L2 single track finding efficiency, relative 
to the offline efflciency, as a function of |do'''|. The 
points represent the data. The vertical dashed lines 
represent the trigger selection requirements. The fit- 
ted curves represent possible descriptions of the effi- 
ciency which are described in the discussion of sys- 
tematic uncertainties in Sec. IIX Al 

Even though Eg is approximately constant 
within the trigger acceptance requirements, the 
rapid drop after \df\ > 1000 /im, introduces a 
particular problem. The trigger efficiency is cal- 
culated depending on which tracks are found by 
the SVT. If Eg is constant for all impact param- 
eters, then the tracks which were actually found 



The value of Eg is determined simultaneously 
with the lifetime and other parameters in the fit 
to data and not from Fig. [51 The consequence on 
the lifetime measurement of the small deviations 
of the real SVT efficiency from this simple model 
are discussed in Sec. IIX Al We also assume that 
there is no variation in track finding efficiency as 
a function of track pT or 77. Such variations can 
alter the probability of finding a particular track 
combination. However as these are time inde- 
pendent, the effect on the lifetime measurement 
is expected to be small. This is also discussed in 
Sec. IIX Al where we show that the effects of these 
simplifications on the lifetime measurements are 
indeed, sufficiently small. There is an alterna- 
tive, simpler approach, that docs not depend on 
Eg , which is suitable in situations where the track- 
finding efficiency is constant over a larger range 
than for the SVT at the CDF II detector. This 
is discussed in Appendix [Ul 

c. Calculating Ei{t,Eg) The value of 
Ei{t, Eg) for a given decay time is the probability 
that at least one of the possible track combina- 
tions that pass the trigger criteria is in fact found 
by the L2 tracking algorithms. For example, 
if there is only one track pair in the candidate 
that can pass the selection requirements, then 
the probability of finding both those tracks is 
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e^, where we simply take the product of two 
single track finding efficiencies. For a three body 
final state, where there are two possible track 
pairs that pass the trigger, the probability is 
given by 2e'^ — £5- cases where there are three 
possible track pairs (only possible for the low-py 
selection that makes no requirement on track 
charge), the probability to find sufficient tracks 
to pass the trigger is Se^ — 2el. 



3. Translating Online and Offline quantities 

To calculate the trigger efficiency for all pos- 
sible B proper decay times we scan through dif- 
ferent B decay points along the B flight path 
and determine the probability that the trigger 
was passed at that point. As we re- apply the 
trigger selection, we always base the decision on 
the quantities accessible to the relevant trigger 
level i.e., L2 criteria to SVT tracks, L3 criteria to 
L3 tracks, and offline criteria to the fully recon- 
structed offline tracks. Certain quantities such 
as the track momentum or the opening angle be- 
tween two tracks are decay time independent and 
will remain constant as the vertex is translated 
along the B flight path. Other quantities such 
as the impact parameter will change. Therefore, 
as we translate the B decay along its flight di- 
rection, we need to re-calculate the decay time 
dependent quantities for each level: L2, L3, and 
offline. 

It is trivial to calculate the offline impact pa- 
rameters and reconstructed proper decay time as 
the candidate is translated along its flight path. 
Furthermore, as Ei{t,es) is a function of the 
offline-reconstructed proper decay time, rather 
than the true decay time, it is not necessary to 
reconsider the effects of detector resolution. This 
means that there is a simple, one-to-one relation- 
ship between the offline-reconstructed decay time 
of the translated candidates and the other time- 
dependent offline quantities such as impact pa- 
rameters and Lxy, without the need to take into 
account further resolution effects. We aim to 
retain a similarly simple direct relationship be- 
tween proper decay time and trigger cuts for the 
online quantities as well. Since all L2 and L3 de- 
cay time dependent quantities (dg, L^y) are cal- 
culated from the impact parameters of the tracks, 
the value of the online do is the only parameter 
we need to consider. 

As we translate the candidate along the B 
flight path, we re-calculate each track's online 
do at L2 and L3 (dp^ and dg^), by assuming 
that the differences between online and offline 
quantities are not decay time dependent. This 
way, we can treat this difference in exactly the 



same way as the other proper decay time inde- 
pendent quantities in the candidate, such as track 
Pt ■ We measure the differences in each candidate 
and keep them constant as we translate the can- 
didate along the B flight path. The difference 
between the L2 and offline impact parameter, 
(Ad, 



0)L2 



jL2 



joff 
"0 I 



could vary as a function 
of impact parameter due to the finite hit recog- 
nition patterns used to measure the L2 impact 
parameter. We verify in data that {Ado)L2 is 
time independent. To check this, we calculate 
(Ado)L2 and bin it according to track |dQ^|. In 
each bin, the (A(io)L2 distribution is fitted with a 
Gaussian, and the mean and width of the fitted 
Gaussian for different impact parameter ranges 
is shown in Fig. [31 There are some deviations 
from a straight line, but there is no systematic 
dependence on impact parameter, and hence on 
impact parameter resolution as a function of de- 
cay time. Variations in the impact parameter 
resolution, such as those observed in data, could 
lead to a bias on a lifetime measurement. This is 
addressed in Sec. llXl and we find any systematic 
uncertainty on the lifetime due to this variation 
to be very small (0.02 ps). 
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FIG. 3: The difference, (Ado)i2 is binned as a func- 
tion of |do^| and fitted to a Gaussian. The mean of 
the fitted Gaussian is shown in (a) while the width is 
given in (b). The variation is of order a few microns. 
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The (Ado)L2 for a given track is measured at 
the actual point of decay by accessing the in- 
formation of the L2 track that was matched to 
the offline track. This is then used to calcu- 
late the translated L2 impact parameter d^'^it) 
from the translated offline (io(i) at each point: 
d()'^{t) = (t) + {Ado) L2 ■ A complication arises 
for those tracks not found by the SVT (such as 
those with |c?o| 3> 1 mm). In this case, a value 
of {Ado)L2 is assigned by drawing a value at 
random from the distribution of (A(io)L2 from 
tracks where it is possible to calculate {Ado)L2- 
One further issue to consider is that the L2 algo- 
rithm measures impact parameters to the closest 
10 iim. To emulate this feature of the L2 tracking 
algorithm the calculated <iQ^(t) is rounded to the 
closest multiple of 10 y^tm. The same procedure is 
applied to estimate dg^except that no discretiza- 
tion is necessary. The online L^y values at L2 
and L3 for each track pair are then re-calculated 
from the translated L2 and L3 impact parameters 
of each track. 



4- Example 

To illustrate the entire process, we describe in 
detail a specific example shown in Fig. 2] which 
depicts the same decay at four different decay 
times. For the purposes of this illustration we 
assume this decay has been accepted by the 
medium-pT trigger selection. First, we consider 
the decay vertex translated to point oi as shown 
The decay vertex is close to the 



in Fig. 4(a) 



primary interaction point and only one track has 
\dQ^\ > 120 ^m, therefore the selection require- 
ments are not met. The value of Ei(t,es) at the 
proper decay time corresponding to oi is Hi = 0, 
where _ff is a polynomial function of e^that gives 
the value of the efficiency function at a given de- 
cay position. 

In Fig. |4(b)| the decay vertex has been trans- 
lated further along the B momentum direction 



and is at the point where one track pair satisfies 
the trigger selection and the B decay satisfies all 
other selection requirements listed in Sec. IIV Al 
At this point, 02, the value of Ei{t,es) is given 
by probability of finding both the ttb and the ttd 
track, which is H2 = e^. 

As this candidate is further translated along 
its B momentum direction it moves into the re- 
gion where all three tracks can participate in the 
trigger decision. In Fig. 4(c) two track combi- 
nations fulfill the trigger requirements, (ttb, t^d) 
and (ttd, Kq). The remaining combination, (tt^, 
Ko)^ docs not pass the trigger in this case as it 
does not satisfy the opposite charge requirement 
of the medium-px trigger. The value of Ei{t,es) 
at the decay point 03 is the probability that at 
least one of the two possible track combinations 
is found by the SVT, iJg = 2el - el 

In Fig. |4(d)| the decay vertex has been trans- 
lated to the point 04 where the track impact pa- 
rameter requirements are not satisfied. The value 
of Ei{t, Es) returns to zero at the point where the 
trigger requirements are not met. Hence Ei{t, Eg) 
can be described by a series of intervals limited 
by imin and imax and within an interval the value 
of Eiit^Es) is given by a polynomial in terms of 
Es, H{es)- The efficiency function can be written 
in terms of the Heaviside step function as 



intervals 
in event i 



(11) 



C. The signal PDF and its parameters 

Substitution of Ei{t,Es) as given in Eq. ([TT|) 
into Eq. © leads to the following PDF for ob- 
serving a decay at time i^: 



ViU;T\T,E,it,Es),s) 



E,{t,Es)\t=u^e- 



- p I 1l — 21. 



E Hfc.(e, 



fei=aU 
intervals 
in event i 



p{± 



(12) 



r 



We describe the decay time resolution of the de- 
tector as a Gaussian with width at = 0.087 ps. 
This is the average of the calculated candidate- 



by-candidate CTt- of the background subtracted 
signal region in data. Using a single Gaus- 
sian based on a single, global Ut, instead of a 
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(a)If the decay occurs at point ai, only one track has (b)If the decay occurs at a2 there are two traclcs wliicli 
an impact parameter within the trigger range (shaded would pass all trigger and other selection requirements, 
region). Ei^t^Cs) at the corresponding t is zero. Ei{t^Ss) at the corresponding proper time is the 

probability that both these tracks are found by the L2 
algorithms. 
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(c)If the decay occurs at 03 there are two tracks pairs 
which would pass all trigger requirements. Ei(t,es) 

rises at the point t^nin 2 as the probability to find at 
least one of the two available track pairs is greater 
than the probability to find a particular track pair. 
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(d)If the decay occurs at 04 all the track impact 
parameters are above the trigger threshold. Ei{t^6s) 
at tmaa; 3 retums to zero. 



FIG. 4: The decay vertex is translated along the direction of the B momentum while the decay kinematics 
are held fixed. At each decay point it is determined whether or not the selection criteria could be satisfied 
and Ei{t,es) is calculated. Diagrams not to scale. 



candidate-by-candidatc value, significantly sim- 
plifies the analysis and is justified since the PDF 
is not very sensitive to the exact value of at- 
This is the case for two reasons: the lifetime 
to be measured, 0(1.6ps), is much larger than 
at = 0.087 ps; and the selection requirements re- 
move the majority of candidates with low decay 
times. 

In terms of the PDF in Eq. this implies 

that all terms containing at only have a small 
effect on the PDF because t/r ^ ^a^/r^ and 

- T") ~ -^(^) ~ These approxima- 
tions are not made in the PDF, but they illustrate 
why the dependence on at is small. In Sec. IIXI 



we confirm that the systematic uncertainty due 
to the resolution parametrization is small. 

To use this PDF to extract the lifetime, knowl- 
edge of Es is also required. Although Eq. p2|) 
could be used to simultaneously fit r and Eg, 
there is extra information available in the data 
that can be used to help determine with 
greater precision. The extra information used 
is simply the knowledge of exactly which tracks 
do, and do not, have L2 information. To add 
this information to the PDF, we introduce a can- 
didate observable called track configuration, Ci. 
This observable is defined both by n, the number 
of tracks that arc within the reach of the SVT 
{PT > 2.0GeV/c, \df\ G [0,1] mm), and by r, 
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the number of those that have L2 information. 
The configuration also distinguishes which spe- 
cific tracks have L2 information. The probabiUty 
of observing a particular C;, i.e., that of 77. tracks 
within the reach of the SVT, a specific set of r 
tracks have matches, while the remaining n — r 
tracks do not, is given by 

V{a\T,E.,{t,es),U,s)^ ; , (13) 



where the factor Ei{t, es)\t=ti provides the cor- 
rect normalisation as it is the sum of all possible 
configurations that could have passed the trigger. 



We multiply the probabilities defined in 
Eqs. (ini) and ^ to obtain the PDF which is 
used to simultaneously fit the proper decay time 
and £s- It is given by 



ViU; t|T, E,{t, £,), syr{C,\T, E,{t, Es), U, s) 



intervals 
in event i 



(14) 



In the case of a two body decay, we would always 
find, in both the numerator and denominator of 
the expression, that Yikiiss) = £3(1 ~ Es)""*^ = 
all factors containing would cancel and we 
would recover the expression for two-body decays 
derived in Ref. [25| . If there is no upper impact 
parameter cut or equivalent (tmax = 00), and the 
lower cut is hard enough so that for each candi- 
date ^min > o-*, Eq. (HH) reduces to ie-(*-*">i")/^, 
equivalent to a re-definition of < = 0, as used by 
DELPHI in Ref. Other special cases lead- 

ing to some simplifications are discussed in Ap- 
pcndixO However, none of these apply here and 
we use the full expression given in Eq. ()14|) . 



VI. VALIDATION OF THE METHOD 

We test the signal PDF derived in Sec. |Vl and 
the full PDF with both signal and background 
component that will be derived in Sec. IVIIl on 
simulated events. We use two kinds of simula- 
tions: a full GEANT 3-based [l3| detector simu- 
lation and a fast parametric simulation for high 
statistics studies. 



A. The Full Detector Simulation 

We use the full CDF II detector simulation 
to test whether the signal PDF constructed in 
Sec. |V] can correctly remove the selection bias. 
The simulated data samples used for this test 
consist of single B hadrons generated with pT 
spectra consistent with NLO QCD [13, [13 and 



decayed with EvtGen [1^. A detailed GEANT 3- 
based detector and trigger simulation is used 
to produce the detector response, which is pro- 
cessed using the same reconstruction algorithms 
as data. In addition to a D'^n^ sample, 

we also use samples of three other decay modes; 
5° D+n- {D+ K-Tr+TT+) , B^ ^ (jycj) and 
Bs K^K^, where the offline selection cri- 
teria applied are broadly similar to that of the 
B^ D^-K" candidates. These distinct sam- 
ples, with differing topologies, allow for further 
crosschecks of the basis of the method to correct 
the selection biases. The calculation of the effi- 
ciency function is easily extended to include four 
track decays using the same principle of scan- 
ning through all possible proper decay times as 
described in Sec. |Vl 

As these samples contain only signal events, we 
use the PDF described in Eq. ([H)) to simultane- 
ously extract the lifetime and the L2 single track 
finding efficiency. The fitted lifetimes, along with 
the input truth lifetimes and size of each sample 
are given in Table HIl The fitted lifetime is consis- 
tent with the input lifetime for each Monte Carlo 
sample. These results indicate that the method 
of calculating the event efficiency can be used to 
correct the selection biases. 



B. The Fast Simulation 

In addition to the full CDF II detector simu- 
lation we use a custom fast simulation which is 
several orders of magnitude faster than the de- 
tailed simulation. It allows production of many 
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TABLE II: The fit results on full detector simulated B decay samples. The table also gives the true input 
lifetime and the size of the sample after selection cuts had been applied. 



Decay Sample size Input lifetime Measured lifetime 

B- D°n- 75000 496 ^m 493.3 ± 3.2 /xm 

D+TT- 71000 464 /xm 467.8 ± 2.8 ^m 

Bs H 35000 438 /xm 443 ±5 ^m 

Bs K+K- 75000 438 /xm 441. 5± 2.9 ^l\TL 



thousands of independent samples, each approx- 
imately the size of the data yield (24, 000 signal 
events), that are used for the extensive validation 
and studies of systematic uncertainty. The fast 
simulation is used for validating the technique 
with simulated signal and background events, 
and for evaluating systematic uncertainties. Nei- 
ther the fast simulation nor the full simulation 
described earlier is used to determine or constrain 
any of the parameters that enter the likelihood 
fit to data from which we extract the B~ life- 
time. Below, we describe the fast simulation with 
its default settings. These form the basis of the 
validation studies presented later. How the de- 
fault behavior is altered to estimate systematic 
uncertainties is discussed in Sec. IIXI 

In order to reproduce the data as well as possi- 
ble with a relatively simple simulation, we gener- 
ate many of the kinematic variables in each event 
based on distributions observed in data, in par- 
ticular when generating background. The most 
important ones are summarized in Table Hill For 
every event i we generate the B~ proper decay 
time, ti, the reconstructed mass, m,;, the mea- 
sured momentum. Pi, and the meson proper 
decay time. The B~ mass is generated from 
the PDF described in Eq. ^ using the best fit 
parameters from the mass fit to the data sam- 
ple. For signal events, the B~ and proper 
decay times are generated as exponentials using 
the 2008 world average values of the lifetimes, 
which are 1.637 ps and 0.41 ps for the B~ and 
mesons, respectively [IJ]. The generated proper 
decay times are smeared by a Gaussian of width 
0.087 ps to simulate the detector resolution. The 
generation of the reconstructed B~ proper decay 
time in background events is based on the PDF 
described in Sec. IVII Al Its parameters are de- 
termined from data, by fitting the lifetime distri- 
bution of the events in the upper mass sideband. 
The background proper decay time is taken 
from the decay time distribution observed in 
the upper mass sideband. The direction of the 
B~ momentum is generated uniformly in </> and 
77. As transverse quantities are used to deter- 



mine the measured proper decay time in data, it 
is important to match the pt distribution in the 
simulation, to that observed in data. The magni- 
tude of the B momentum is generated such that, 
after the selection criteria are applied, the dis- 
tribution of Pt of the remaining simulated signal 
events matches the pr distribution observed in 
the background subtracted signal region. Simi- 
larly, we generate the magnitude of the momen- 
tum for background events so that after selection 
there is agreement between the px of simulated 
events and the upper sideband in data. 

We calculate the remaining kinematic variables 
as follows. In the rest frame of the B~ particle, 
the magnitudes of the reconstructed and ttb 
momenta are defined by the generated mass of 
the B~ meson and the world average values for 
the and tt masses [l^l ■ The reconstructed 1)° 
mass is kept fixed because, in data, the mass- 
constrained vertex forces the reconstructed 
mass to the world average value. We pick a di- 
rection for the -Kb momentum isotropic in the B~ 
rest frame; the momentum is in the opposite 
direction. These momenta are then transformed 
into the laboratory frame to calculate the simu- 
lated and ttb momenta. The equivalent pro- 
cedure is carried out to calculate the ttd and Kd 
momenta in the laboratory frame. The B~ and 
13° decay vertex positions are calculated from 
the generated proper decay time and momentum; 
knowledge of these allows for track impact pa- 
rameter calculation. These impact parameters 
are defined to be the offline impact parameters. 

We simulate the SVT with a single track find- 
ing efficiency of el'^ =65% for signal events and 
e^'^s = 55% for background events. The effi- 
ciency is different for signal and background be- 
cause, in general, we find in our data that back- 
ground tracks have fewer hits in the silicon layers, 
and hence a lower track finding efficiency. The 
values for the track finding efficiency we use for 
the simulation are approximately those found in 
data for tracks with \d'f\ < 1000 ^m, obtained 
from the simultaneous proper decay time, mass, 
and efficiency fit (the fit results for all param- 



17 



TABLE III: Kinematic parameters of the fast simulation and the parent distribution used for generation. 
Details are given in the text. 



Randomly generated parameter 



Parent distribution 



t reconstructed decay time of B (signal) 



t o reconstructed decay time of 
\P\ magnitude of momentum 
•niB mass 



(signal) 
(signal) 



^ 1 



background-subtracted data 
(signal) PDF given in Eq. ([3]), fitted to data 



t reconstructed decay time of B^ (bkg) 

tY) reconstructed decay time of Z?" (bkg) 

|P| magnitude of momentum (bkg) 

niB B~ mass (bkg) 



PDF given in Eq. (|T8l), fitted to data 

sideband data 

sideband data 
PDF given in Eq. dS]), fitted to data 



azimuth angle of B momentum 
pseudorapidity of B^ 



Ad, 



L2 jL2 



Joff 



uniform 
uniform with \'q\ < 1.5 
Gaussian, then round 
d^"^ to nearest 10 jiva. 



eters can be found in Appendix |Dj Simulation 
tracks with \d'^\ > 1000 /im are not used in the 
trigger decision, and are treated in the fit as not 
found by the SVT, so there is no need to model 
the behavior of the SVT efficiency for tracks with 
> 1000 fim. For those tracks that are found, 
the SVT-measured impact parameter, is ob- 
tained by adding a Gaussian-distributed random 
number to the dg^. The Gaussian is centered at 
and has a width of 35 /im, which is consistent 
with the width observed in data (Fig. |3(bJ| ). The 
result is then rounded to the nearest 10 /im, as 
in the real SVT. The difference between the L3 
impact parameter and is not simulated. 
Although the mean dg^ in data is shifted from 
zero, further tests, detailed below, confirm that 
the central value of the dg^ distribution does not 
affect the results. Therefore, these differences be- 
tween the fast simulation and the data will have 
a negligible effect on the interpretation of the re- 
sults. 

After all kinematic quantities have been ob- 
tained in the way described above, the selection 
criteria are applied to replicate the biases ob- 
served in data. All decay products are required 
to lie in the fiducial volume of the CDF II detec- 
tor. The three two-track trigger configurations 
summarized in Table U represent three different 
sets of selection criteria. Events are generated 
with each set of cuts separately and then com- 
bined in the fractions observed in the data. In 
data we observe very few events with tracks that 
have \r]\ > 1.5. Therefore events that have simu- 
lated tracks with I77I > 1.5 are removed from the 



sample. For background, prior to applying se- 
lection cuts, we further reject events so that the 
Pt spectrum of the candidate ttb after the cuts 
are applied matches that observed in the data 
upper sideband. This further rejection for back- 
ground events effectively changes all kinematic 
distributions observed after the selection crite- 
ria are applied and forces the simulated back- 
ground to have the characteristics of background 
observed in data. Overall there is broad agree- 
ment between the distributions of impact param- 
eters, momenta, Acj) of track pairs, and r/ in the 
simulated and real data. As impact parameters 
are particularly important in this analysis, we 
compare the tt^i impact parameter distribution 
from the fast simulation and in real data in Fig.[5l 
Given the simple nature of the fast simulation, 
the agreement with data is remarkably good, al- 
though of course not perfect. Since the simula- 
tion is not used to determine any parameters in 
the final fit to data, but only to test the robust- 
ness of the method and to estimate systematic 
uncertainties, we do not rely on a perfect match 
between the simulation and the data, and the 
agreement we observe is sufficient. 



C. Validation of the method on signal 
events 

We use the custom fast simulation for high- 
statistics tests of the signal PDF given in 
Eq. ([HI). We generate 1000 samples of 24,000 
signal events each, similar to the yield observed 
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FIG. 5: A comparison of the impact parameter dis- 
tribution of the ttd track in data (triangle points) 
and fast simulation (circular points). The compar- 
ison between generated signal events and the back- 
ground subtracted signal region in data is shown in 
(a) while (b) shows the comparison between gener- 
ated background events and the upper sideband in 
data. All distributions are normalized to one event. 



in data. The proper decay time distribution 
for each sample has been sculpted by the same 
decay time dependent selection cuts as in real 
data, applied to the simulated data as described 
in the previous section. We maximize a likeli- 
hood function for signal events, constructed from 
the PDF in Eq. dH]), to extract a best fit life- 
time for each sample. Fitting the resulting pull 
distribution with a Gaussian, we find a mean 
ji = —0.026 ± 0.034 and a standard deviation 
a = 1.027 ± 0.024. This demonstrates that 
Ei{t, £s) is correctly calculated, and that the like- 
lihood formed from the PDF in Eq. can cor- 
rect for the selection biases. It also shows that 
assigning the value of Ado to tracks that did not 
have an SVT match, from the distribution of Ado 
of tracks that did have an SVT match, does not 
cause any bias. In addition to this single test to 
validate the method itself we performed further 



tests to cross check our assumptions described 
below. 

There are some differences between the value 
of the single track finding efficiency applied in 
the fast simulation and the efficiency observed 
in data. To test that the results were not sensi- 
tive to the default values of the efficiency chosen 
for the fast simulation, we varied the input ef- 
ficiencies around the default values and saw no 
bias due to the value of input efficiency or due to 
the difference between signal and background ef- 
ficiencies. The fitted efficiency was always consis- 
tent with the input value. In Sec. IIII C] we noted 
that there have been three changes to Eg over the 
course of the period of data taking used for this 
analysis. To determine whether it is sufficient 
to parametrize the SVT track finding efficiency 
with a single value (representing the average Ss 
over these three data taking periods), we gener- 
ated samples containing events simulated using 
three different values of Eg in the proportions ob- 
served in data. These samples were fit using only 
one average parameter, which was allowed to 
float in the fit. The resulting pull distribution 
has a mean consistent with /xm, however the 
width is 1.19±0.03. This can be understood as 
follows: Each Ei(t, Eg) is a measure of the statisti- 
cal power of each event [25| . By using an average 
Eg, the statistical power of each event has been 
incorrectly assumed in the fit leading to an in- 
correct estimate of the statistical uncertainty. If 
instead, we allow for three floating efficiency pa- 
rameters where each parameter is only sensitive 
to the events in one of the data taking periods, 
the resulting pull distribution once again has unit 
width. Therefore, in the fit to data, we use three 
parameters to describe £s, each floating in the fit, 
one for each data taking period. 

In the default simulation the (Ado)L2 distri- 
bution is generated with a Gaussian distribution 
with mean fi = fim and width cr = 35 /im. 
As the fit method takes all its information about 
(A(io)L2 from data, and makes no assumptions 
about the shape of the (Ado)L2 distribution, we 
expect it to perform equally well for any {Ado)L2 
distribution, including asymmetric and biased 
distributions. We test this by generating data 
with two alternative models for {Ado)L2- For 
the first model we use a biased impact param- 
eter resolution function described by a Gaussian 
with mean fi = S5 fim and width (t = 35 /im. 
To truly stress-test the sensitivity of the method 
to the {Ado)L2 distribution, the second alterna- 
tive model is a, somewhat unrealistic, biased and 
asymmetric resolution function described by an 
exponential decay distribution with mean 35 /xm, 
so that all dp^ are larger than the dg^ . For both 
models we perform pull studies with the same 
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sample size as observed in data and observe no 
bias in the fitted lifetime. This confirms that the 
fit method is robust with respect to the shape 
and mean of the (Ac?o)l2 resolution function, and 
that the observed shift from zero in data of the 



mean of the distribution in Fig. 3(a) does not 
affect the fit result. 

Other assumptions, including the dependence 
of the SVT efficiency on impact parameter, pt, 
and the effect of small differences in the {Ado)L2 
resolution depending on impact parameter, are 
discussed as sources of systematic uncertainties 
in Sec. llXl 



VII. THE COMBINED PDF FOR 
SIGNAL AND BACKGROUND EVENTS 

In this section we derive the PDF for a sam- 
ple containing signal and background events. We 
remind the reader that wc use four measured 
observables in the fit; the measured proper de- 
cay time, ti, the efficiency function, Ei{t,es), 
the mass, m^, and the track-configuration ob- 
served, Ci. An unbinned maximum likelihood fit 
is used to determine the lifetime of the B me- 
son and other parameters. Candidates in the 
data sample have passed the selection criteria, 
T, which means that we must consider the con- 
ditional probability that a candidate has a par- 
ticular ti , rrii, Ei{t,£s), and C;, given that the 
selection criteria have been satisfied. There are 
only two classes of candidates in the data sample: 
signal and background; therefore, the likelihood 
function is defined as 

C = \{[V {s,U,m,,C,,E,[t,es);T\T) 

i 

+ 7'(6,t„m„a,S,(t,£fs)|T)], 

(15) 

where the first term represents the likelihood for 
signal candidates and the second term is the like- 
lihood for background candidates. For readabil- 
ity, the dependence on other fit parameters, such 
as those related to the parametrization of the 
mass distribution, is suppressed and only the de- 
pendence on the fit parameter r is explicitly writ- 
ten. 

The PDF for signal candidates can be factor- 
ized into the following form, 

V[s,ti,mi,Ci,E.i{t,es)]T\'Y) = 
V{U-T\T,E,{t,es),s) 
xria\T,E,it,es),U,s) (16) 
xP(S,(t,£,)|T) x7'(m,|T,s) 
xP(s|S,(t,e,),T) 



where a detailed derivation of this factoriza- 
tion is given in Appendix |^ There is also 
an entirely analogous factorization for back- 
ground candidates. The combined factor 
riU;T\T, E,{t, Es), s)r{Q\T, E,{t,es), U,s) de- 
scribes the proper decay time distribution and in- 
cludes the track configuration information which 
determines Eg- Note that 'P{Ei{t,es)\T) and sim- 
ilar expressions refer to the probability to find a 
given efficiency function Ei{t, Sg)- It does not re- 
fer to the function as evaluated for a given t or 
Eg, but to the function as a whole. V{Ei{t, £s)|T) 
therefore does not depend on the value of ti or 
Es- The factor V{Ei{t, es)\'T) is independent of 
T and whether a candidate is signal or back- 
ground. Hence, it can be ignored in the like- 
lihood. The factors Vim,\T,s) and Vim,\T,b) 
(from the background part of the PDF) describe 
the mass distribution and are described earlier, 
in Sec. HVB] The final factor V{s\Ei{t,Ss),T) is 
the probability that a candidate is signal given its 
efficiency function. Each factor of the likelihood 
is normalised to one candidate. 



A. The parametrization of the background 
proper decay time PDF 

This section considers the proper decay time 
term in the PDF in Eq. the analogous term 
for the background candidates, and describes the 
parametrizations of the PDFs used for the fit. 
For the signal component a physics model is used, 
for the background contribution it is sufficient to 
provide an empirical description of the data. The 
first two factors on the right hand side of Eq. ((T6)) 
are identical to the left hand side of Eq. 
and this is the PDF used to fit the proper decay 
time and single track finding efficiency for signal 
candidates. Three different values of ef^ are fit, 
one for each time period as described in Sec. IVll 

For background candidates 

V{tf,T\T,E,{t,es),b)Via\T,E,{t,es),t„b) 

^ £fs"(l-£fE)"-''y(t,) 

t ' 

-bkg\ 



E i?.(ef^) / y{t)dt 

*min k 



k—all 
intervals 



(17) 



where, similarly to signal, there are three values 
of e^^^ to fit. The function y{t) can be deter- 
mined empirically from the data. Simple forms 
of y{t), such as a sum of exponentials convoluted 
by a Gaussian, were found to provide an unsat- 
isfactory description of the data. Therefore, the 
function y{t) is empirically determined using an 
interpolation of exponentials given by 
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y{t) = e°'"^( '^ti-*.' for tj <t< tj+i. 

(18) 

We use ten fit points (tj), which are spaced more 
closely at low t where the proper decay time 
distribution of background candidates is concen- 
trated. The values of the corresponding aj are 
determined alongside the other fit parameters 
in the unbinned maximum likelihood fit. This 
parametrization was tested on data from the up- 
per sideband to ensure that it is a good model 
for the data. The tests on the upper sideband 
were only used to distinguish the performance of 
different parametrizations. No fit parameters are 
fixed from this test. 



B. The complication in combining the 
signal PDF and the background PDF when 
using a candidate-by-candidate efficiency 
function 

Combining the signal and the background PDF 
while using a candidate-by-candidate efficiency 
function introduces a significant complication 
into the analysis. The rest of this section de- 
scribes this problem, and its solution, in de- 
tail. As discussed in [l^, when a candidate-by- 
candidate quantity enters a fit with a signal and 
background component, the PDF for this quan- 
tity needs to be included in the fit. In our PDF 
this effect is taken into account by a term that de- 
scribes the candidate-by-candidate signal proba- 
bility dependent on the Ei(t,es)- So, instead of 
an overall signal fraction P(s), there is a signal 
weighting for each candidate which depends on 
the efficiency function. This is described by the 
factor V{s\Ei{t,es),T), and the corresponding 
term for background is simply V{b\Ei{t, Es), T) = 
1 — V{s\Ei{t, Es), T). Alternative ways of factor- 
izing the PDF would lead to different ways to 
take this effect into account, but, regardless of 
the choice of factorization, the underlying need 
to include a PDF for the efficiency function re- 
mains. 

The factor 'P{s\Ei{t,es),T) can be simplified 
to an overall signal fraction, 7^(s), only in the 
case where the efficiency function distributions 
are the same for signal and background. Figure |B] 
shows the mean efficiency function, Ei{t,es) for 
candidates in the upper sideband (background) 
and the background subtracted signal region. 
The mean is determined simply by summing all 
efficiency functions in a sample and dividing by 
the number of candidates. The two Ei{t,es) are 
clearly different which shows that the distribu- 
tion of efficiency functions in signal and back- 
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FIG. 6: The mean E{t,es) function for signal 
and background candidate. Signal (solid line) and 
background (dashed line) candidates have different 

ground must be different. We can estimate the 
bias on the lifetime measurement we would get 
if we were to ignore the differences in the effi- 
ciency function, by simplifying V{s\Ei{t, es),T) 
to V{s). Using the custom fast simulation de- 
scribed in Sec. IVI B| wc find a bias of approxi- 
mately — 0.018±0.001 ps. Any advantage gained 
in precision by using a simulation independent 
method would be negated by a bias of this size. 
Therefore, a successful simulation-independent 
method for correcting a trigger bias must include 
a proper description of the term 'P{s\Ei{t, £s), T). 

C. Calculating the term V{s\Ei{t, £s),T) 

1. Overview 

To correctly represent the PDF in the fit we 
require a parametrization of the signal fraction 
that is dependent on the candidate-by-candidate 
efficiency function, Ei(t,es). However, it is dif- 
ficult to parametrize a distribution of functions, 
and that is what is required to derive a signal 
probability as a function of each individual effi- 
ciency function. The problem is simplified if we 
represent Ei(t,es) by a number, Xi, as it is con- 
siderably easier to parametrize the distribution of 
the scalar variable x rather than a distribution of 
functions, i.e., we aim to find a variable x such 
that we can replace 'P{s\Ei{t, e^), T) by assuming 
'Pis\x,T) w V{s\E,{t, es),T). For this approach 
to succeed, x must be chosen in such a way that 
the loss of information regarding the signal prob- 
ability contained within Ei{t, e^) is minimized as 
we transform from Ei{t,es) to x. Note that the 
transformation of Ei{t,es) to x is only used for 
determining the signal probability of each can- 
didate. The proper decay time probabilities are 
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unchanged and continue to use Ei{t,es), as the 
trigger bias cannot be corrected without the fuU 
description. To summarize, the parametrization 
of the term 'P{s\Ei{t,es),T) involves two steps: 

• Transforming the efficiency function 
Ei(t,es) into a representative number x. 

• Describing the signal fraction as a func- 
tion of X, P(s|x, T) with a suitable function 
whose parameters will be determined in the 
fit. 

These arc discussed below. 



2. Representing the Efficiency function by a Scalar 

In order to translate the efficiency function 
Ei {t, Eg) into a scalar variable, we make use of the 
Fisher Linear Discriminants method j30| . This 
method transforms a vector of variables into a 
single scalar variable. We represent each Ei{t, Eg) 
as a vector that contains all the relevant infor- 
mation about Ei{t,es) and then use the Fisher 
discriminant method to translate this vector into 
a number, the Fisher scalar, x. Note that we 
do not use the Fisher discriminant method to se- 
lect candidates. The scalar resulting from the 
Fisher discriminant method is optimized for dis- 
tinguishing signal from background, and there- 
fore fulfils the requirement of minimizing the loss 
of information about the signal probability as we 
translate Ei{t,es) to Xj, so that P{s\xi,T) sa 
P{s\Ei{t,es),T) to a very good approximation. 
How good this approximation is, is quantified 
below. Here, we summarize the method rather 
briefly. Further details can be found in Ap- 
pendix |B] 



3. Finding the Fisher discriminant in a simulation 
independent way 

The Fisher scalar variable, Xi, is given by 
Xi = w ■ Vi where Vi represents one candidate's 
efficiency function and w is another fixed vector. 
The Fisher Linear Discriminant method provides 
a way to determine a vector such that it max- 
imizes the separation of signal and background 
candidates in the variable x. The transformation 
of the information contained in Ei{t,es) to Vi is 
described in detail in Appendix |B] The transfor- 
mation does not require the values of ef^ or e^^^ 
and hence the transformation can be done before 
the fit determines the values for the efficiencies. 

In typical uses of the Fisher Linear Discrimi- 
nant method, the calculation of w requires not 



only the knowledge of all the Vi, but also knowl- 
edge of Vg and v^, which are the mean Vi for 
signal and background candidates, respectively. 
Traditionally, Vg and are determined from 
independent training samples, such as detailed 
Monte Carlo data. Since this analysis uses no 
input from simulation we use the data itself to 
calculate Vs and Vb. For this measurement, we 
use candidates in the upper sideband to deter- 
mine Vf,. We perform a background subtraction 
on candidates with 5.25 < ras < 5.32 GeV/c^ 
to determine Vg. Further information regarding 
the determination of Vf, and Vg is given in Ap- 
pendix lEl 



4. Testing the assumption that 
P{s\x„T) ^ P{s\E^{t,es),T) 

Before proceeding further, it is important to 
test the assumption that the Fisher scalar vari- 
able Xi is representative of Ei{t,es)- We use a 
custom fast simulation and fit the lifetime of the 
1000 independent samples of signal and back- 
ground candidates, using the Fisher scalar Xi to 
determine a signal probability per candidate. It 
is desirable to quantify how the assumption that 
P{s\E,{t,eg),T) « P{s\x^,T) affects the fit re- 
sult, in a way that is independent of any partic- 
ular parametrization of P{s\xi^T). (The partic- 
ular choice of parametrization is discussed sep- 
arately and is described in Sec. I VII C 51 ) To 
do this we make use of the truth information 
available from the simulated data. As shown by 
the data points in Fig. [71 7'(s|xi, T) can be cal- 
culated by finely dividing the sample into 100 
bins in x, and simply counting the number of 
signal and background candidates in any par- 
ticular bin of the variable x. So, for each Xi, 
we determine V{s\xi,T) by reading its value off 
a histogram generated from the truth informa- 
tion. We find that the mean lifetime shift in 
those 1000 fits is only 0.0013 ps, which is sig- 
nificantly smaller than —0.018 ps found when 
the distribution of efficiency functions is ignored. 
This demonstrates that the variable Xi is a sat- 
isfactory substitute for Ei{t,es) for the purposes 
of calculating the probability that a candidate 
is signal given its efficiency function, and that 
P{s\Ei{t, es),^) ~ P{s\xi,T) is a reasonable as- 
sumption. This mean shift of 0.0013 ps is small 
in comparison to the statistical uncertainty from 
the data sample size and is taken as the sys- 
tematic uncertainty due to assuming the scalar 
variable is entirely equivalent to using the full 
efficiency function. This method of calculating 
P{s\xi,T) is only used in this set of test fits. For 
other tests, (and for the final data fit), no truth 
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information is used, and the parameterization of 
7'(s|a;i,T) described in Sec. IVIIC 51 is used. 



parametrization works as well as the truth in- 
formation any systematic uncertainty due to the 
parametrization of P{s\xi,T) is negligible. 



5. Parametrizing the signal fraction as a function 
of the Fisher scalar variable 

In order to apply this method to real data in 
a simulation-independent way, we need to find a 
function that parametrizes P(s|a;j;, T), and whose 
parameters can then be determined in the fit to 
data. We use Lagrange interpolating polynomi- 
als as they provide a very general parametriza- 
tion that makes minimal assumptions about the 
shape of the distribution to be fitted. This 
parametrization has as its parameters the sig- 
nal fractions pj at certain discrete values of the 
Fisher discriminant Xj, so P{s\xj,T) = Pj- The 
value of P{s\x, T), for general x, is calculated us- 
ing a smooth interpolation between those points. 
The Pj are determined in the fit. 

Our default choice for the Xj is the following: 
We divide the a; axis into N — 15 equal bins. 
As the number of candidates at the edges of the 
distribution is small, we merge the first two bins, 
and also as the last two bins. We place our xj 
at the center of each of the resulting bins. This 
results in 13 fit parameters, pj, representing the 
signal fractions at the 13 Xj. We tested the ro- 
bustness of this choice by trying out different 
numbers of bins iV, and found that there is neg- 
ligible difference in performance for any value of 
N from 10 to 20. 

This parametrization is tested using the fast 
simulation. Figure [7] shows the projection of the 
fitted Lagrange interpolating polynomial, /(x), 
where the truth information has been superim- 
posed for one sample of simulated data. In con- 
trast to the test in Sec. IVII C 41 where we tested 
the assumption P{s\xi,T) « P{s\Ei{t, es),T), 
the fit, here, is performed in the same way as in 
our final fit to real data: at no point is truth in- 
formation or any external simulation input used 
in the fit, and the pj parameters of P{s\xi,T) 
are determined in the fit at the same time as 
all other fit parameters, such as the lifetime or 
Es- The projection of P{s\x,T) obtained in this 
fit matches closely the histogram obtained from 
truth information, giving us confidence that this 
parametrization provides a good description. We 
tested this parametrization using 1000 simulated 
samples and observed a mean residual of 0.0013 
ps. The lifetime pull distribution is described by 
a Gaussian with mean 0.039±0.036 and width 
1.097±0.029. This demonstrates no further shift 
in the mean residual position relative to the small 
shift, resulting from the assumption P{s\xi,T) sa 
P{s\E,{t, es),T), observed in Sec. EUni As the 



6. Summary: The full signal & background PDF 
with the factor V{s\Ei{t,es),T) 

In summary, we find that the PDF in Eq. p^ . 
with the factor P{s\Ei{t,es),T) parametrized as 
described in this section, successfully corrects for 
the selection bias in data samples where both a 
signal and a background component is present. 
The 0.0013 ps residual is taken as a systematic 
uncertainty due to the method of describing the 
term V{s\Ei{t, es),T) by the Xi variable. The 
width of the pull indicates that the method un- 
derestimates the statistical uncertainty by 10±3 
%. To be conservative we increase the statistical 
uncertainty of the fit to data accordingly. 
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FIG. 7: The data points show the signal fraction as 
a function of Fisher scalar for a sample of simulated 
data. The line shows the projection of the Lagrange 
interpolating polynomial determined by the simulta- 
neous fit to proper decay time, signal fraction and 
other parameters. 



VIII. FIT RESULTS 

This section describes the fit to data se- 
lected by applying the selection criteria listed in 
Sec.|lV3 An initial mass fit is performed, as de- 
scribed in Sec. IIVB[ with seven free parameters. 
The best fit results are given in Appendix |D] The 
results of the mass fit are used to perform the 
background subtraction required to calculate the 
Vs, which is needed for the Fisher Discriminant 
Analysis. 

The lifetime is determined in a second fit. The 
likelihood function used in this unbinned max- 
imum likelihood fit is given by Eq. (|15l) and 
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Eq. pj)) and is 



log£ = 



ki— all 
intervals 
in event i 



' /i • e , (1 - /i) • e 
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intervals 



1 — am,; 
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a /™2 



(19) 



where y{t) is defined in Eq. (fTSl) and /(x) is 
described in Sec. I VII C 51 The parameters that 
determine the mass shapes for signal and back- 
ground are fixed at the values determined in the 
initial mass fit. However, the signal fraction is 
not taken from the mass fit because this is now 
redefined in terms on the Fisher scalar variable. 
In total, there are 30 free parameters in the life- 
time fit. These are the following: one for the 
signal lifetime, ten to describe the background 
proper decay time distribution as described in 
Sec. IVII Al 13 parameters to determine the sig- 
nal fraction as a function of the Fisher scalar, 
/(x), defined in Sec. IVII C 51 and six parameters 
to describe the single track finding efficiency as 
described in Sec. IVII Al 

The proper decay time fit projection for all 
events in the fit is shown in Fig. [S] The func- 
tion ,f{x), determined by the fit is shown in 
Fig. [21 and the distribution of the variable x 
itself is shown in Fig. (TU] To assess how well 
,f{x) determines the signal fraction, the data with 

— 7<x <2 are divided into nine bins. A mass 
fit is performed separately for the events in each 
bin to obtain an independent measure of the sig- 
nal fraction in that bin. For x outside the range 

— 7<x<2, there are insufficient data to per- 
form a mass fit. The signal fractions, as deter- 
mined by the series of mass fits, are overlaid on 
the function, f{x), in Fig. |9l and there is good 
agreement between the two determinations of sig- 
nal fraction. 

The fit result for the B~ lifetime is t{B~) = 
1.663 ± 0.023(stat) ps, where the uncertainty has 
already been scaled by the factor 1.1 as discussed 
in Sec. IVII C 61 The fit results for all other pa- 



rameters can be found in Appendix |D] The life- 
time is only weakly correlated to the other fit 
parameters; the correlation coefficient between 
the lifetime and any other fit parameter is al- 
ways less than 10%. The statistical uncertainty 
on t{B~) is about twice as large as one would 
naively expect from dividing the fit result by 
the square-root of the number of signal events, 
frnaive ~ T / \/ ^sg = 0.011 ps, which usually 
gives a reasonable estimate for data with good 
proper decay time resolution and small back- 
ground contamination as we have here. As shown 
in Ref. [IHl, the cause for the increased uncer- 
tainty is the trigger bias, specifically the upper 
impact parameter cut in the trigger, which leads 
to a significantly reduced statistical precision per 
event. The size of the effect is consistent with 
that calculated in Ref. [25| . 



IX. SYSTEMATIC UNCERTAINTIES 

In this section we evaluate the systematic un- 
certainty on our measurement from a variety of 
possible causes. The two dominant uncertainties 
are due to the dependence of the single track find- 
ing efficiency of the SVT on impact parameter 
(Sec. IIX A[) and the correlation between the mea- 
sured ti and mi that we observe in background 
data from the upper sideband. 

We evaluate each uncertainty as follows: for 
each source of uncertainty 1000 samples of sim- 
ulated data are generated using the fast simula- 
tion. Each sample contains approximately the 
same number of signal and background candi- 
dates as are found in data. The samples are gen- 
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FIG. 8: This figure shows the projection of the lifetime fit onto the data. The signal and background com- 
ponents are shown separately (dotted lines) and in addition (solid line). The points are data. The lower plot 
shows the residual divided by the error for each bin. 



erated using a non-standard configuration that 
simulates the effect under consideration; we then 
extract the B~ lifetime from each sample in the 
same way as we do for data, using the standard 
PDF described in Sec. IVIIl For each source of 
systematic uncertainty, the mean residual, (fit- 
ted lifetime — input lifetime), averaged over the 
1000 samples, is taken as the systematic uncer- 
tainty. The statistical uncertainty on the mean 
residual from 1000 generated samples of simu- 
lated data is approximately 0.0007 ps and sys- 
tematic uncertainties of this size or smaller are 
deemed negligible. 



A. The dependence of the single track 
finding efficiency on impact parameter 

The leading source of systematic uncertainty is 
the parametrization of the L2 single track find- 
ing efficiency function of track impact pa- 
rameter. As described in Sec. |Vl we assume that 
Es {\df\) is constant for \df\ < 1000 ^im. Fig- 
ure [5] shows the efficiency as a function of \dQ^\ 
in data, and indicates that Ss starts dropping 



slightly before \df\ = 1000 /im. To obtain a 
model for the track finding efficiency to use in 
the simulation, we fit the SVT single track find- 
ing efficiency as a function of |do*^| found in data, 

using the function s(\df\) = po x G ( '"'"'p'^' ), 

where po, Pi, and p2 are free parameters and G{x) 
is the complementary error function defined as 
G{x) = exp{—t^)dt. This fit results in one 

particular determination of the single track find- 
ing efficiency shape. We create other SVT single- 
track efficiency distributions, consistent with the 
data, by varying pq , pi and p2 by the statistical 
uncertainty of their fitted values. Of these dis- 
tributions we choose the three which we expect 
to produce the largest biases in the fitted life- 
time, i.e., the distributions that have the largest 
difference in efficiency between |dg^| = and 
I fig*'' I = 1000 Aim. These three SVT single track 
efficiency functions, one of which is the original 
fit result itself, are represented by the three lines 
in Fig. [2] The different single track efficiency 
functions are implemented in the simulation by 
assigning SVT matches with the probability de- 
termined by the given function. For each of the 
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FIG. 11: SVT single track finding efficiency as a func- 
tion of tracli transverse momentum. Tlie (line) is a 
third order polynomial fit to the data (points). 



that esid-Q^) is constant for tracks with impact 
parameter less than 1000 /im. 



FIG. 9: Projection of the signal fraction, f{x), as a 
function of the Fisher scalar, determined from the 
fit (line). The data points are the signal fraction 
determined from mass fits using events only lying in 
that particular bin of x. 
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FIG. 10: The distribution of the Fisher Scalar vari- 
able X, in data. 



B. Single track finding efficiency 
dependence on px and r/ 

The fit also neglects the dependence of Eg on 
Pt and r] for tracks that pass the trigger criteria, 
i.e., with Pt > '2 GeV/c. Figure [TT] shows Esipr) 
in data. The line through the data represents a 
fit using a third order polynomial. The efficiency 
is obtained in a similar manner to Figure [21 where 
the third track, in the sub-sample of candidates 
where the other two tracks are sufficient to pass 
the trigger, is used to determine the efficiency. 
The Pt dependence is incorporated into the fast 
simulation by assigning SVT matches based on 
the probability given by the polynomial func- 
tion. We determine a systematic uncertainty of 
0.006 ps. Similarly, we evaluate the effect of the 
dependence of the SVT single track finding effi- 
ciency on the track's pseudorapidity and obtain 
a systematic uncertainty of 0.001 ps. The de- 
pendence of Eg on track pT and 77 are not large 
sources of uncertainty as they are not directly re- 
lated to the proper decay time unlike the impact 
parameter. 



three functions considered, a set of 1000 simu- 
lated samples is generated, and fit, with the stan- 
dard PDF that assumes a flat SVT single track 
efficiency for \df\< 1000 //m. The mean lifetime 
residual from the fits to these samples varies from 
-0.0060 to -0.010 ps, depending on the values of 
POj Pi andp2 used. To be conservative we assign a 
0.010 ps systematic uncertainty due to assuming 



C. Dependence on the impact parameter 
resolution shape 

We assume that the impact parameter reso- 
lution between the offline and online algorithms 
remains constant as a function of impact parame- 
ter. As discussed in Sec. lVI C| it has been shown 
that the technique of sliding the decay vertex is 
insensitive to the actual shape of the resolution 
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as long as the shape remains constant. In the 
data we do, however, see subtle differences, at the 
level of a few microns, in the mean and width of 
the resolution as a function of impact parameter 
(Fig. [3l Sec. |V|. To test this effect we incorpo- 
rate such differences, as found in data, into the 
fast simulation. The bias observed due to this is 
0.002 ps. 



D. Dependence of background observables 
on mass 

In data we observe a correlation between the 
measured ti and rn; for background candidates in 
the upper sideband, which is shown in the scat- 
ter and profile plot in Fig. [121 We assume that 
this correlation is described well by a linear re- 
lationship and determine that the mean recon- 
structed proper decay time of background varies 
by approximately 0.13 ps over a mass range of 
0.27 GcV/c^, which is the mass range used in 
the lifetime fit. The derivation of the PDF as- 
sumes that the proper decay time has no de- 
pendence on the measured mass of the back- 
ground candidate. To test the effect of neglecting 
the correlation between U and in the PDF 
we extrapolate the same linear correlation for 
background candidates underneath the peak as 
observed in the sideband. We generate simu- 
lated data where background candidates are re- 
jected in such a way as to introduce a correla- 
tion between the mass and proper decay time 
of the candidate, similar to that observed in 
data. We determine a systematic uncertainty 
of 0.0083 ps using samples of fast simulation 
signal and background candidates. This is one 
of the leading sources of systematic uncertainty. 
It could be reduced in future measurements 
by defining a proper decay time parametriza- 
tion for background that includes dependence on 
the mass. One possible way to do this would 
be to assume that V{ti\T, Ei{t, es),b,mi) = 
P(4|T, E,{t, Es), b) where t] = U+l3(mi-mo),mo 
is a central mass value, and care is taken to en- 
sure proper normalization. 

In the derivation of the PDF, we also assumed 
that there was no relation between Ei{t,es) and 
TTii for background candidates. Candidates in the 
upper sideband are used to calculate v^, which is 
necessary to determine the Fisher discriminant. 
We assume that the calculated vj, is representa- 
tive of all background candidates. To test the 
sensitivity of the lifetime result to the particular 
background sample, we repeat the lifetime fit to 
data but now use candidates with reconstructed 
mass between 5.5 < Mb < 5.7 GeV/c^ to cal- 
culate vt,. There is no change in the fitted life- 
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FIG. 12: A scatter and a profile plot shows the cor- 
relation between the mass and proper decay time of 
candidates in the upper sideband. 



time for data which demonstrates that there is 
no significant relation between Ei(t,es) and rrii 
for background candidates. 



E. Background proper decay time 
parametrization 

To test the reliability of the y{t) parametriza- 
tion described in Sec. I VII 51 we seek an alternate 
parametrization of the data. We use the sum 
of two exponentials convoluted with the detec- 
tor resolution. This parametrization of the back- 
ground is not used in the main fit, as the quality 
of fit to the sideband data is poor. Nonetheless, 
we can generate simulated data where the back- 
ground proper decay times are generated using 
the sum of two exponential functions with mean 
lifetimes of 0.787 ps and 0.0282 ps in the ratio 
1:7.3 as found from a fit to the sideband. This 
results in a background proper decay length dis- 
tribution that has similar characteristics to the 
distribution observed in the upper sideband. We 
fit these simulated data samples with the stan- 
dard PDF. The mean fifetimc residual is 0.0027 
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ps and we take this as an estimate of the system- 
atic uncertainty due to the background proper 
decay time parametrization. 



F. Silicon Alignment 

To determine the uncertainty due to a possible 
misalignment of the silicon (SVX-II) detector we 
consider radial shifts in the silicon layers towards 
and away from the beam pipe, of 50 /xm as has 
been done in other lifetime measurements at the 
CDF experiment, for example, Ref. (sij. The 
shifts in the silicon layers change the measured 
hit positions of the tracks. To first order, the 
mis-measurement in track impact parameters are 
related to a 50 fim shift in the silicon layers by 
the relation d{do) = 50 • sin(a), where a is the 
angle between the track and the perpendicular 
to the silicon layer in the transverse plane. We 
recalculate the measured impact parameters in 
the fast simulation containing the misalignment 
model. The proper decay time of the candidate is 
recalculated using the shifted impact parameter 
values. Fitting 1000 samples generated in this 
way we find a systematic uncertainty in lifetime 
measurements due to silicon layer misalignment 
of 0.0013 ps. 



G. Detector Resolution Model 

In the fit to data, we describe the detec- 
tor time-resolution with a Gaussian of width 
0.0087 ps. We estimate the systematic uncer- 
tainty due to the chosen resolution model by 
generating data sets with an alternative resolu- 
tion function and fitting it using the standard 
PDF. The alternative resolution function is de- 
scribed by a sum of three Gaussians with widths 
0.0067, 0.0124, 0.0249 ps and relative fractions 
1:0.92:0.04. This resolution function derives from 
a study of prompt D mesons combined with an 
extra track from the primary vertex. We test the 
effect of this alternate resolution using the fast 
simulation. From the 1000 samples of fast sim- 
ulated data, we find that the mean residual is 
0.0010 ps and we take this as a systematic uncer- 
tainty. 



H. Signal composition 

We also consider contamination of the signal 
peak by the decay B~ — D^K~ . This decay can 
appear in the sample if the kaon track is recon- 
structed as a pion and the resulting decay passes 



the selection criteria. Although this is the de- 
cay of a charged B meson, the proper decay time 
distribution of this decay mode will be altered 
as the mass has been miscalculated. We use the 
fast simulation to estimate the fraction of — > 
D^K~ candidates that pass the lower mass cut. 
This information, in conjunction with the rela- 
tive branching fractions of the — )• D°7r~ and 
B^ D^K~ decay modes 2J], results in the 
estimate that 3% of the candidates in the signal 
peak are actually misreconstructed B~ — > D^K~ 
decays. This fraction is introduced into the fast 
simulation and the effect on the best fit lifetime 
is negligible. 



I. Summary 

A list of systematic uncertainties is given in Ta- 
ble IIVI Wc combine the uncertainties in quadra- 
ture to find a total systematic uncertainty of 
0.015 ps, which is smaller than the statistical 
uncertainty of 0.023 ps. The leading sources of 
systematic uncertainty are related to the details 
of the SVT single track finding efficiency, and 
the correlation in background between the recon- 
structed proper decay time and mass. Neither of 
these are irreducible and, should the systematic 
uncertainty become a limiting factor in future 
measurements, it should be possible to improve 
them significantly. A more detailed description of 
the SVT track finding efficiency, which can be ob- 
tained from data, can be incorporated into the fit 
to reduce the leading systematic error. Similarly, 
the correlation between the mass and proper de- 
cay time in background candidates can be incor- 
porated into a future version of this technique to 
reduce the second largest contribution. 



X. RESULT AND CONCLUSION 

We introduce a simulation-independent 
method for measuring lifetimes in event samples 
where the selection criteria bias the proper decay 
time distribution. We apply it to measure the 
B^ lifetime in data collected by the hadronic 
B trigger at CDF, which selects events with 
displaced tracks and thus biases the measured 
proper decay time distribution. 

In previous analyses, the trigger bias has been 
corrected for using an efficiency function ob- 
tained from Monte Carlo simulation. This simu- 
lation dependence can be a significant source of 
systematic uncertainty. A recent example is the 
measurement of the A^ lifetime in the hadronic 
decay channel A^ — >■ AcTt at CDF. which found 
r(Ab) = 1.401±0.046 (stat) ±0.035 (syst) ps 
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TABLE IV: Summary of systematic uncertainties. 



Source of systematic uncertainty Uncertainty(ps) 



Track finding efficiency dependence on c?q^ 


0.0103 


Track finding efficiency dependence on pT 


0.0060 


Variation in impact parameter resolution 


0.0020 


Track finding efficiency dependence on r/ 


0.0010 


Mass-proper decay time correlation in background 


0.0083 


Background proper decay time parametrization 


0.0027 


Silicon alignment 


0.0013 


Transformation of Ei{t) to scalar variable 


0.0013 


Detector resolution model 


0.0010 


Signal composition 


negligible 


Total systematic uncertainty 


0.015 



The systematic uncertainty in this measurement 
is almost entirely due to the simulation depen- 
dence. While currently smaller than the sta- 
tistical uncertainty, the systematic uncertainty 
could limit the precision in future, higher statis- 
tics measurements. 

The method introduced here removes the sim- 
ulation dependence by replacing the global effi- 
ciency function with candidate-by-candidate ef- 
ficiency functions that can be calculated ana- 
lytically from the event data, without recourse 
to simulation. We test the method extensively 
with simulated data, and finally apply it to mea- 
sure the lifetime of the B~ meson, t{B~), us- 
ing 23900 ± 200 B- D^tt- candidates, where 
D" K-TT+, collected by CDF's hadronic B 
trigger in 1 fb^^ of data. We extract t{B~) from 
the data without input from simulation. We mea- 
sure t{B-) = 1.663 ± 0.023 (stat) ±0.015 (syst) 
ps. This result is in good agreement with the 
world average of 1.638 ± 0.011 ps This tech- 
nique generalizes easily to other decay channels, 
as we have demonstrated in Sec. IVI Al It can 
be applied to any situation where the trigger or 
other selection criteria bias the proper decay time 
distribution of the reconstructed data. 
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Appendix A: Factorizing the PDF 

This appendix details the factorization of the 
PDF term 'P{s,U,m,,C^,E^{t,es)]T\I) which 
describes the probability that we observe an 
event with given values of t;, m^, Eiit^Eg), 
and Ci. Although there are a number of 
ways to factorize the expression in Eq. (|15p we 
aim to find a final form that includes the fac- 
tor V{U- r|T, E,{t, s) xP(Ci|T, E,{t, e,),t,, s) 
as a parametrization for this factor, given in 
Eq. JTH) is well understood. We make use of the 
following relation 

VIA, B) = V{A)r{B\A) = r{B)r{A\B). (Al) 

Wc only explicitly write the dependence of the 
PDF on the observables. Using Eq. (|A1[) . 
7^(s, ti, TTii, Ci, Es); t[T) can be split into two 
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factors: 

'P{s,ti,mi,Ci,E.i{t,es);T\T) = 
Vis,U,E,{t,es);T\T) (A2) 
X Vim,,a\T,E,{t,es),U,s;T). 

The last factor, V{mi,Ci\T,Ei{t,es),ti,s;T) is 
concerned with the probabihty of observing a 
particular mass and track configuration. This 
can be factorized further, again using Eq. (|A1[) : 

V{mt,Ci\T,Ei{t,es),ti,s;T) = 
rim,\T,E,{t,e,),U,s;T) (A3) 
X Victim,, T,E,{t,es),U,s;T). 

The measured mass is independent of the mean 
lifetime and of the efficiency function and we 
make the assumption that it is independent of the 
measured proper decay time. With these simpli- 
fications we can say that 

V{m,\T,E,{t,es),U,s;T)=V{m,\T,s). (A4) 

The track configuration is indepen- 
dent of the mean lifetime and the mass 
and therefore we can make the simpli- 
fication V{Ci\mi,T, Ei(t,es),ti,s;T) = 
'P{C^\T,E^{t,es),U,s). Substituting this 

and Eq. into Eq. (|A3| leads to 



(A5) 



The remaining factor in Eq. (jA2| is factorized 
further using Eq. (|A1[) : 



'P{mi,Ci\T,Ei{t,es),ti,s;T) = 
V{m,\T,s)r{a\T,E,{t,e,),U,s). 



ris,U,E^{t,es);T\T) = 
ViU;T\T,E^{t,es),s) Vis, E^{t,Ss)\T). 



(A6) 



The first factor on the right hand 
side corresponds to the first factor 
in Eq. ([TH). Applying Eq. one 
more time we find P(s, £'.j(t, es)|T) = 
V{s\E^{t,es),T)riE^{t,es)\T). Therefore, 

V{s,U,E,{t,es);T\T) = 

r{U;T\T,E,{t,es),s) (A7) 
X V{E,{t,e,)\T) Vis\E,{t,e,),T). 



Substitution of Eq. (|A7)) and Eq. (|A5|) into 
Eq. (|X2|) leads to 



V (s, mj, C^,Ei{t, Es)] r|T) = 
riU;T\T,E,{t,es),s) 
X Via\T, E,{t, Es), U, s) r{E,{t, E,)\T) 
xV{m^\T,s)Vis\E,{t,E,),T), 



(A8) 



which is the same as the expression given in 
Eq. (HH) in Sec. IVlIl 

There are other ways to factorize the PDF. 
One of particular interest is a parameteriza- 
tion that depends on the overall signal fraction 
rather than the event-by-event signal probabil- 
ity used here. This can be obtained by replacing 
ris\E,{t,E,),T) withris)riE,{t,E,)\s,T) (and 
equivalently for the background terms). P(s), 
often written as fs, is the overall signal frac- 
tion and P{b) = 1 — P{s) is the background 
fraction. This PDF differs from the one we use 
by an overall factor P{Ei{t,Es)\T) which does 
not affect the maximum of the likelihood func- 
tion. Our choice of PDF is driven by the ease 
of parameterization of the required function. It 
is easier to parameterize the smoothly varying 
candidate-by-candidate signal probability (see 
Fig. [21 P(s|£'i(i, Eg), T)), rather than parame- 
terise the fine structure observed in the fisher 
scalar distribution (see Fig.[TUl P{Ei(t, Es)\s, T)). 



Appendix B: Characterizing the efficiency 
function by a vector of variables 

In order to parametrize the term 
'Pis\E,{t,Es),T), which arises in the PDF 
in Eq. ([TS]) we have chosen to use Fisher 
Discriminant Analysis to characterize each 
Ei{t,Es) by a scalar variable xi (as described 
in Sec. IVIIC2p . To use the Fisher Discrim- 
inant method we need to construct a vector 
Vi, whose components describe the efficiency 
function, Ei{t,Es)- How this vector is obtained 
is described here. 

The vector Vi contains a series of variables: 
vi, V2---, Vn ■ Each variable should describe a 
property of the efficiency function in a way that 
allows comparison of one candidate's efficiency 
function with that of another. In Sec. IV Bl we 
showed that the efficiency function can be written 
as 

E^it,Es) = 



intervals 
in event i 



(Bl) 



where H{es) is either e^, 2£^ — or 363 — 2ef, de- 
pending on whether there were one, two or three 
track pairs that could have passed the trigger. 

From inspection of Ea. (|Bl[) . a single efficiency 
function can be uniquely defined by a series of 



variables that are t„ 



tn 



and the value of 



Hk{Es)- However, this is not a useful description 
for comparing one efficiency function to the next 
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because the number of intervals, and hence the 
number of variables required to describe the efh- 
ciency function, varies from one candidate to the 
next. 

For all candidates the efBciency function is de- 
fined for proper decay times in the range 0-10 
ps. Another way to construct vt would be to bin 
the efficiency into n equal bins of time and take 
the mean value of Hk{es) in each bin as the ele- 
ments vi, V2..., Vn- In this way, the value of any 
particular element of for one candidate can be 
compared to the same element of Vi for another 
candidate. This method is also problematic as 
the is a floating parameter and the Fisher Dis- 
criminant (and, hence, the observable Xi) cannot 
be recalculated at each iteration of the likelihood 
minimization. 

We take an approach that allows construction 
of Vi so that its elements can be compared across 
candidates without requiring knowledge of the 
value of Ss- The efficiency function can be re- 
written in the form 



ground events, v^, can be found from averaging 
the Vi for candidates with 5.37 GeV/c^ > tob, 



el) 



(B2) 



where, 



- E 

J— all intervals 
with 



[9{t — f„iin j) — 9{t — t,nax j)] 



(B3) 

and corresponding terms for At and Ac- The 
value of the functions A at any time are either 
or 1. Writing the efficiency function this way 
splits it into three sections, dependent on the 
value of H{es)- Comparing Aa from one can- 
didate to the next allows comparison of the ef- 
ficiency function arising from the parts where 
there was only one track pair available to pass 
the trigger. To construct Vi we bin each of the 
A functions into 20 bins as a function of proper 
decay time. The value of V1...V20 are the values of 
Aa in each bin. Nominally the value in any given 
bin is either or 1, however, where the efficiency 
turns on or off within the bin an intermediate 
value is taken to represent the mean efficiency 
in that bin. Similarly the values of W2i---W4o are 
the values of Ab in each bin and vh-.-v^q are the 
values of Ac- By splitting the efficiency func- 
tion into three parts, dependent on the form of 
Hk{ss), we have found a vectorial representation 
of the efficiency function that is independent of 
the absolute value of Ss and that allows compari- 
son of Ei{t,es) between different candidates. We 
now have a prescription for converting Ei{t,es) 
into v-i for each candidate. The mean v for back- 



i.e.. 



mB>5.37Gey/c2 



1 



(B4) 



To determine Vg we first determine Vr, which is 
the average of the Vi for events that have mass in 
the range 5.25< hib <5.32 GeV/c^. As this re- 
gion contains both signal and background events, 
Vs can be determined from Vr by subtracting the 
appropriate fraction of Vb- This fraction is deter- 
mined from a fit to the mass distribution. Hav- 
ing determined Vb and Vg, the direction to, and 
therefore Xi can be determined using Fisher Dis- 
criminant Analysis [soj . 



Appendix C: A simpler PDF 

A lot of the complexity of the method pre- 
sented here results directly or indirectly from the 
tight upper impact parameter cut applied by the 
two track trigger. In situations where this upper 
impact parameter cut is significantly looser, or 
ideally where no such cut is applied at all, one 
would not only benefit from a higher statistical 
precision for each candidate [1^, but would also 
be able to employ a significantly simpler version 
of the method as outlined below. In this simpler 
version 

• the dependence on can be removed, 

• under many circumstances, there is no need 
to use the Fisher discriminant. 

While we did not choose this approach for rea- 
sons specific to the CDF II detector trigger (as 
discussed below), it is summarized here for the 
benefit of potential users of this method at other 
experiments. 



Removing the dependence on Es 



As described in Sec. IV B 21 if the track-finding 
efficiency is decay time independent, one can base 
a fit on the PDF given that a certain track com- 
bination has been reconstructed and seen by the 
trigger. Given that a certain track combination 
has been found, the trigger efficiency at a certain 
decay time is either 1 (passes cuts) or (fails), 
independent of Eg- With this, the signal PDF 
given in Eq. ([T^ reduces to: 
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r{t,;T\T,E,{t),s) = 



E 



-til"* 



fcj— all 
intervals 
in event i 



F 



(CI) 



This approach, which is independent of e^, is 
vahd whenever the track-finding efficiency is in- 
dependent of the decay time for all tracks in the 
candidate. Despite the drop of the SVT track 
finding efficiency beyond Irfg*^] > f mm, this ap- 
proach could, in principle, be used in the data 
analyzed in this paper if we applied a fiducial 
cut of |c?o^| < f mm (where the SVT efficiency 
is effectively constant) to all tracks in the de- 
cay (this cut would of course need to be reflected 
in the efficiency function calculation). This is 
a significantly harsher requirement than that of 
the trigger, which requires only two out of three 
tracks to have 0.f2< |c?q^| < f mm, allowing one 
of the tracks to have |(iQ^| > 1mm. We stud- 
ied this option and found that the loss in sta- 
tistical precision due to the additional cut is too 
large^mainly because of the effects discussed in 
Ref. |25| . This simpler approach would however 
be suitable in a situation where the track-finding 
efficiency is constant over a larger range than for 
the SVT. 



with P{b\E,it,es)) = P(6|imi„) = 1 - P{s\tn.in)- 

There is now no need for the Fisher scalar vari- 
able although the PDF term still requires a de- 
scription of the signal fraction as a function of 

Even simpler: Redefining t=0 

Finally, in the case where there is no upper life- 
time cut (i.e. <max = oo), and the lower lifetime 
cut is hard enough to satisfy tmin ^ crt, all the 
"F-tcrms" in Eq. (|Cip are 1, and the equation 
reduces to 

P(t) = le-(*-*""")/" (C2) 
r 

which is equivalent to an event-by-cvent re- 
definition oi t = 0, as used by DELPHI in 
Ref. 1^. 

Appendix D: Full fit results 



Removing the need for a Fisher Discriminant 

If the dependence on has been removed as 
described above, and in addition there is no vari- 
able upper proper decay time cut (no upper im- 
pact parameter cut), the candidatc-by-candidate 
Ei(t) is fully determined by one single parameter, 
the decay time tmin where the acceptance "turns 
on", i.e., above which the decay is accepted. Re- 
membering that the motivation for introducing 
the Fisher discriminant was to translate the effi- 
ciency function into a single number, this would 
clearly be unnecessary, as Ei{t) is already fully 
described by a single number, ^min- The factor 
P{s\Ei{t,es)) can then be replaced by P(s|<min), 



TABLE V: Summary of best fit mass parameters and 
uncertainties. 



Parameter Best fit Uncertainty 



mi [GcV/c2] 
m2 [GcV/c2] 
CTi [GeV/c2] 
as [GeV/c2 ] 
/i 

/a 

a [(GeV/c2)-i 



5.2762 
5.2711 
0.0247 
0.0138 
0.481 
0.741 



±0.0004 
±0.0025 
±0.0033 
±0.0010 

±0.13 
±0.0050 



-0.1658 ±0.0035 
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TABLE VI: Summary of best fit efficiency parame- 
ters and uncertainties. The three periods correspond 
to the changes in the SVT described in Sec. HIT CI 



Efficiency parameter 


Best fit Uncertainty 


Signal Period 1 


0.488 


±0.033 


Signal Period 2 


0.656 


±0.009 


Signal Period 3 
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Background Period 1 
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Background Period 2 
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Background Period 3 
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±0.017 



TABLE VII: Summary of best fit background proper 
decay time parameters and uncertainties. The ct 
value represents the points where the background 
proper decay time distribution is sampled as defined 



by tj in Eq. ( 








Backg. 
Param- 
eter 


ctj (/im) Best fit Uncertainty 


Oi 





10.80 


±0.39 


0-2 


146.9 


7.08 


±0.06 


03 


322.6 
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±0.04 


04 


532.7 


2.73 


±0.04 


05 


783.9 


1.29 


±0.07 


ag 
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as 
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TABLE VIII: Summary of best fit signal fraction, 
and uncertainties, as a function of Fisher scalar. The 
values of Fisher scalar gives the mid point of each 
bin used by the Lagrange interpolating polynomial 
function as described in Sec. IVII C 51 



1 'isher 
param- 
eter 


r isner 
scalar, x 


Best tit 
signal 
fra,ction 


Uncertainty 


Xl 


-8.35 


0.139 


±0.072 


X2 


-7.05 


0.273 


±0.071 


X3 


-6 19 
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+n 030 
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X4, 


-5.32 


0.379 


±0.014 


X5 


-4.45 


0.535 


±0.011 


xe 


-3.59 


0.657 


±0.008 


xr 


-2.73 


0.768 
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